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Abstract 


Six  sample  selection  schemes  with  seven  estimators  were  studied 
in  a  simulation  study  drawing  samples  of  20  units  from  eight  forestry 
populations.  Sampling  strategies  were  compared  on  the  basis  of  bias 
both  of  the  estimate  and  its  variance,  mean  square  error,  and  cover- 
age of  confidence  intervals.  Although  there  is  no  consistently  best 
sampling  strategy,  stratified  sampling  with  probabilities  proportion- 
al to  size  (spps),  stratified  sampling  proportional  to  the  cumulated 
values  of  x  (spscx),  and  purposive  sampling  from  the  cumulated  x- 
values  (pscx)  with  the  weighted  regression  estimator  YWR  performed 
well  for  most  situations  with  most  of  the  criteria.  However,  pscx 
showed  evidence  of  estimation  bias  in  several  instances.  No  reliable 
variance  estimator  was  found  for  most  sampling  strategies,  but  the 
bootstrap  variance  estimates  with  spps  tended  to  have  relatively  small 
bias  (<8.58%  bias  for  all  populations).  For  spps  sampling,  use  of 
classical  and  simple  jackknife  variance  estimates  but  not  the  simple 
bootstrap  estimates  yielded  reliable  coverage  probabilities  in  most 
populations.  For  many  sampling  strategies,  variance  estimation  bias 
of  up  to  20%  may  have  to  be  accepted.  The  jackknife  variance  esti- 
mator always  overestimates  the  true  variance.  It  is  clear  that  new,  reli- 
able variance  estimators  may  need  to  be  derived  for  many  sampling 
strategies. 
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Six  sample  selection  schemes  with  seven  estimators  were  studied 
in  a  simulation  study  drawing  samples  of  20  units  from  eight  forestry 
populations.  Although  there  is  no  consistently  best  sampling  strate- 
gy, stratified  sampling  with  probabilities  proportional  to  size  (spps), 
stratified  sampling  proportional  to  the  cumulated  values  of  x  (spscx), 
and  purposive  sampling  from  the  cumulated  x-values  (pscx)  with  the 
weighted  regression  estimator  YWR  performed  well  for  most  situa- 
tions with  most  of  the  criteria. 
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Model-Dependent  and  Design-Dependent 
Sampling  Procedures — A  Simulation  Study 

H.  T.  Schreuder,  H.  G.  Li,  and  G.  B.  Wood 


Management  Implications 

Inventories  are  expensive  but  are  usually  necessary  to 
achieve  management  objectives.  Some  new  sampling 
strategies,  which  include  both  design  and  weighted 
regression  estimators,  are  shown  to  be  highly  efficient 
in  estimating  parameters  of  interest.  It  is  shown  that 
model-based  but  probabilistic  procedures  do  as  well  or 
better  than  purposive  model-based  procedures.  Variance 
estimation  for  such  complex  strategies  is  not  yet  satis- 
fying but  more  precise  estimates  can  be  generated  even 
with  current  designs  with  little  or  no  cost  to  the  forest 
manager. 

Introduction 

Traditional  approaches  to  sampling  such  as  presented 
in  Cochran  (1977)  assume  that  sample  units  in  surveys 
are  selected  by  a  design-dependent  (DD)  technique,  such 
as  stratified  sampling,  probability  proportional  to  size 
(pps)  sampling,  or  simple  random  sampling  (SRS).  If  a 
linear  model  can  be  assumed  between  the  variable  of 
interest  and  a  covariate,  model-based  (MB)  (or  more 
descriptively  model-dependent  (MD))  sampling  may 
yield  more  efficient  but  perhaps  more  biased  estimates 
and  the  estimate  of  the  variance  may  be  unreliable.  The 
purpose  of  this  simulation  study  is  to  compare  tradi- 
tional sampling  strategies  with  some  MB  procedures  and 
some  restricted  representative  sampling  strategies  in 
terms  of  estimation  bias,  efficiency,  and  confidence  limit 
coverage  probabilities  for  a  variety  of  forestry  popula- 
tions. Reliability  of  the  variance  estimates  is  assessed. 


Review  of  Literature 

Linear  models  of  the  form 

yi  =  a-H  /3  xj  +  ej  [1] 

where 

E(ei|xi)  =  0,  V(ei|xj)  =  for  0  <  k  <  2 

E(eiejiXjXj)  =  0  for  i  ^  j 

are  recognized  as  being  appropriate  in  many  forest  men- 
surational  applications  (Spurr  1952)  where  y  is  the  vari- 
able of  interest  (say,  volume),  x  is  an  independent 
variable  (say,  D^H)  that  is  easier  to  measure  than  y  but 
usually  only  of  interest  because  of  interest  in  y,  and  e 
denotes  a  random  error.  Relationships  between  other 
variables  are  often  considered  to  be  either  curvilinear  or 
nonlinear  (say,  tree  height  vs.  tree  diameter  or  tree  height 
over  age  of  tree)  although  such  a  nonlinear  relationship 
may  not  exist  or  may  not  be  obvious  because  of  the  scat- 
ter of  the  data  (Schreuder  et  al.  1987). 


Sample  Selection 

When  there  is  a  strong  linear  relationship  between 
variables  y  and  x,  model-dependent  (MD)  procedures 
may  be  preferred  over  traditional  probabilistic  proce- 
dures. In  MD  procedures,  inference  is  dependent  on  an 
assumed  underlying  model  and  samples  may  be  drawn 
in  a  purposive  (nonprobabilistic)  manner  to  improve  the 
efficiency  of  estimates. 

There  is  no  easily  computed  optimal  sample  for  any 
k  >  0  and  a  ^  0  in  model  [1].  The  optimal  sample  is 
a  function  of  the  distribution  of  the  x-values  and  hence 
depends  on  the  population  under  consideration. 
Schreuder  and  Thomas  (1985),  Schreuder  and  Wood 
(1986),  and  Wood  and  Schreuder  (1986  and  references 
therein)  compared  MD  and  probabilistic  procedures  in 
simulation  and  actual  surveys,  primarily  for  timber 
volume  estimation.  They  generally  concluded  that  MD 
procedures  can  be  more  efficient  than  design-dependent 
(DD)  procedures,  but  a  big  part  of  the  improved  efficien- 
cy is  due  to  the  use  of  a  weighted  regression  estimator 
with  the  MD  rather  than  the  Horvitz-Thompson  ratio 
estimator  used  with  DD  procedures.  And,  although  the 
MD  procedures  with  the  regression  estimator  can  have 
relatively  small  estimation  bias  and  high  efficiency  for 
the  populations  studied,  it  is  not  clear  why  this  should 
generally  be  true  for  other  populations.  For  example,  the 
pscx  procedure  was  quite  efficient  for  three  timber  sale 
and  timber  plot  volume  populations  considered  in 
Schreuder  (1986)  with  the  Horvitz-Thompson  estimator. 
This  was  due  to  the  elimination  from  the  samples  of  the 
smallest  and  largest  x-values  (and  hence  the  extreme  y/x 
ratios)  for  the  ratio  estimator. 

Karmel  and  Jain  (1987)  compared  purposive  and  ran- 
dom sampling  procedures  to  estimate  capital  expend- 
iture. They  found  that  a  stratified  sample  consisting  of 
the  units  with  the  largest  x-values  in  each  stratum  with 
a  simple  ratio  estimator  was  by  far  the  most  efficient  sam- 
pling strategy.  And  they  managed  to  obtain  a  meaning- 
ful estimate  of  error  for  this  strategy. 

Herson  (1976)  noted  that  if 

Yk  =  /5iXk  +  Mxk)^^^  k  =  1,2,...,N 

where  e;j,e2,...,eivj  are  independent  random  variables 
with  mean  0,  variance  a^,  then  the  ratio-of-means  esti- 
mator for  the  population  total 

n  n 

t  =  (  E  yu/  E  Xk)XT^ 
k  =  i      k=i  ^ 

is  the  best  linear  unbiased  (BLU)  predictor  of  Y-p  with 

MSE(t)  =  E(t  -  Yt)2  =  a2[(x.j,  _  nx)/nx]XT. 

Here  X-p  denotes  the  population  total  for  the  x-values. 
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This  estimator  (T)  is  also  the  BLU  predictor  for  for 
the  model 

Yk  =  /3o  +  l^i^k  +  1^2^  +  •■•  +  /5jxl  +  ek(Xk)i/2 

when  a  balanced  sample  is  selected.  A  balanced  sam- 
ple sQ)  is  one  for  which 

xO)  =  X(j),  j  =  1,2, ...J 

where 


cO)  =  ±  L  xiand  X(j)  = 
nk  =  i  ^ 


N 


N 

L 

k  =  i 


Royall  and  Herson  (1973a)  noted  that  random  sam- 
pling is  fundamentally  justified  in  that  it  provides  sam- 
ples that  are  approximately  balanced  (the  sample  means 
and  variances  are  about  the  same  as  the  corresponding 
population  mean  and  variance  for  the  covariate)  with 
respect  to  various  characteristics.  Experimenters  often 
reject  random  samples  that  appear  to  be  "extreme"  or 
"unrepresentative."  After  the  sample  is  chosen  and  the 
data  are  gathered,  adjustments  are  frequently  made  in 
calculations  if  the  randomization  process  fails  to  divide 
the  units  into  "sample"  and  "nonsample"  groups  that 
are  similar  with  respect  to  relevant  characteristics.  One 
familiar  technique  of  this  type  is  post-stratification.  This 
is  also  the  basic  rationale  behind  the  ratio-of-means  es- 
timator that  adjusts  the  sample  mean  y  by  the  ratio  ad- 
justment X/x. 

If  not  enough  is  known  about  the  relationship  between 
y  and  x  to  correct  for  imbalance  in  the  sample,  then  a 
sample  should  be  balanced  on  average,  with  respect  to 
X  such  as  in  random  sampling. 

lachan  (1985)  noted  that  the  ratio-of-means  estimator 
T  can  be  badly  biased.  For  example,  for  the  model-based 
expectation  (denoted  by  E^) 

E|  (yj)   =  ^0  +  ^i^^i  for  ^0^0 
and  Gov.  (yj,  yj)  =  a^x.  if  i  =  j 

=  0  if  i  ^  j, 
the  model  bias  of  T  becomes 

E^  (t  -  Y)  =  N  (3o  (X  -  x)/x, 

which  is  small  only  if  |(x  -  X)/|x|  is  small.  This  leads 
naturally  to  the  idea  of  balanced  sampling  such  that 
X  =  X. 

Royall  and  Herson  (1973b)  considered  stratification  on 
a  size  variable  as  an  additional  technique  to  balance  and 
protect  against  model  failure.  They  showed  that  stratifi- 
cation and  balanced  sampling  provide  more  protection 
for  efficient  estimation  than  balanced  sampling  alone 
using  ratio-of-means  estimators.  If  the  population  total  is 

H     Nh  H 

T  =  E    E  yuv  =  D  Tu 
h  =  ik  =  i  '^'^     h  =  i 

where  strata  are  defined  so  that  stratum  1  contains  the 
N-^  smallest  units,  stratum  2  the  next  smallest  units, 
etc.,  for  h  strata  and  a  sample  of  nj^  units  is  selected  from 
stratum  h  and  the  total  for  the  stratum  is  estimated 
by  the  ratio-of-means  estimator 


Yh.rm  =        Yhk/^  '^hk),  ^  ^hk' 
Sh         Sh  k=l 

where  s-^  is  the  sample  in  stratum  h,  then  the  overall  to- 
tal T  is  estimated  by  the  separate  ratio-of-means 
estimator 


t*=  i:  Y 

h=i 


h,rm  • 


This  estimator  can  be  generated  by  the  stratified  bal- 
anced sample  of  degree  J  denoted  by  s*(J)  where  xj^'  = 
X^j  for  j  =  1,2,...,J. 

The  main  results  of  Royall  and  Herson  (1973b)  are 
given  in  the  foDowing  two  theorems: 

—  1/2 

Theorem  1 :  If  Uh  is  proportional  to  then  the 

strategy  [s*  (J),  T*]  is  more  efficient  than 

H     Oh  H  Hh 

[sQ),  Y,^=  (  E     L  y^^/  L  j:  Xhk)XT] 
h=lk=l  h=lk=l 

under  model 

y   =  5q(3q  +  b-^f3^x  +  62^2       ■•"  •••  +  ^J^^J  x^  +  e 
where  V(e)  =  o^x.  That  is, 

E^[Y,^-T]2  >  E^(t*  -T)2. 

Theorem  2:  When  the  size  measures  x^,...,Xi«,j  all  have 
different  values  and  the  stratified  balanced  sampling 
strategy  [s*(J),  T*]  with  equal  allocation  is  employed 
then  the  necessary  conditions  for  optimal  stratification 
under  any  and  all  models  of  the  form 

y  =  8q(3q  +  8^^^x  +  ...  +  6j|3j  x?  +  e 

with  V(e)  =  a^x  are 

(1)  >  N2  >  ...  >  Nh 
and 

(2)  N-^  X^  <  N2X2  ^  ••■  ^  N^X^  if  compatible  with 
(1)  above. 

There  are  3  special  cases  for  which  both  (1)  and  (2) 
in  Theorem  2  are  satisfied: 

(1)  N,  =  N2  =  ...  =  Nh 

(2)  N^Xi  =  N2X2  =  ...  =  NhXh 

(3)  N^^X^  =  N22X2  =  ...  =  Nh^Xh- 

Heuristic  arguments,  preliminary  bits  of  empirical  evi- 
dence, and  theoretical  calculations  led  Royall  and  Her- 
son (1973b)  to  suggest  that  equal  allocation  in  (3)  above 
is  usually  close  to  the  optimal  allocation  scheme. 

Additional  Estimators 

Little  (1983)  discussed  the  merits  of  the  generalized 
regression  estimator 


Ygr=.2  nrMYi-Ui)     E  Ui 
1=1  1=1 


[2] 
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where  U;  =  ^Xj  (or  some  other  regression  estimate)  for 
the  sampled  and  nonsampled  units  and  IT;  is  the  prob- 
ability of  including  unit  i  in  the  sample.  For  example 
if  Xj  =  1  for  all  i,  then  ii:  can  be  a  (possibly  weighted) 
mean  of  the  sampled  values  of  y. 

This  estimator  is  asymptotically  design  unbiased. 
Sarndal  (1980)  proposed  calculating  ^  by  weighted  least 
squares  and  discussed  two  possibilities: 

(a)  n-inverse  weighting  in  which  each  unit  i  is  given 
a  weight  proportional  to  Ilj"^.  For  this  the  estima- 
tor is  called  Ypj  =  Yqj^. 

(b)  Best  linear  unbiased  weighting,  in  which  unit  i  is 
given  a  weight  kj"^  with  kj  proportional  to  the  re- 
sidual variance  of  y;  under  the  regression  model. 
For  this  the  estimator  is  Yg^j  =  Yqj^. 

Estimator  [2]  can  be  viewed  as  a  compromise  between 
model-based  and  design-based  inference.  The  second 
part  of  [2]  is  a  prediction  of  Y  from  the  regression  model. 
The  first  term  is  the  Horvitz-Thompson  estimator  applied 
to  the  residuals  Yj  -  iij  which  protects  the  estimator  from 
model  misspecification  by  ensuring  asymptotic  design 
consistency.  Sarndal  concluded  that  within  this  context, 
the  choice  of  best  linear  unbiased  weighting  to  calcu- 
late jS  seems  more  logical  than  H-inverse  weighting  be- 
cause the  former  yields  an  efficient  estimate  of  (3  under 
the  model.  The  best  linear  estimator  is  the  maximum 
likelihood  (ML)  estimator  under  the  normal  linear  regres- 
sion model  Y;  ~  N(/3^Xj,  k^a^)  where  N(a,  b)  denotes  the 
normal  distribution  with  mean  a,  variance  b,  and  in- 
dependent Yj's.  The  design  weights  yield  ML  estimates 
only  if  rij  is  proportional  to  kj. 

An  alternative  to  [2]  is  the  prediction  estimator  (PR) 


Y,-  + 


Ail. 

Its 


[3] 


which  is  the  sum  of  the  observed  and  predicted  values 
in  the  population. 

Little  (1983)  argued  that  estimators  of  Y-p  should  be 
chosen  from  the  class  of  estimators  [3]  rather  than  [2] 
because  PR  estimators  have  optimal  statistical  proper- 
ties if  the  model  is  true,  and  from  a  frequentist  perspec- 
tive, PR  estimators  from  least  squares  regression  have 
minimum  variance  among  unbiased  linear  estimators. 


Variance  Estimation 

Royall  and  Cumberland  (1978)  discussed  robust  vari- 
ance estimation  for  regression  and  ratio-of-means  esti- 
mators under  a  linear  regression  model.  For  the 
ratio-of-means  estimator 

Yrm  =  N(y/x)X 
the  weighted  least  squares  variance  estimator  is 

(N-n)  -  X2  " 


Vi(Yrm)  =  Vl(Yrm)  =  (N/n) 


X  ^   E  r?/x,. 


(n-1)      X  i  =  i  '  ' 
where  X2  is  the  mean  of  the  x-values  not  sampled  and 


Ti  =  Yi  -  (y/x)  xj. 

Robust  variance  estimators  are 


V2(Yrm)  =  Vh(Yrm)  =  (N/n)(N-n) 


[(n  -  1)(1  -  c^/n)]-\L^vf 

where 


X  Xo 


c2  =  (n  -  1)-VE  (X;  -  x)2/x2, 


i=  1 


V3(Yrm)  =  Vd(Yrm)  =  (N/n)(N-n] 


X2X 
nx^ 


r?[l-^]-i, 
1=1  '  nx 

and  the  standard  variance  estimator  (Cochran  1977)  is 

n 

V4(Yrm)  =  vJYrm)  =  [(N/n)(N-n)/(n-l)].i;^  rf. 

For  the  simple  linear  regression  estimator 
Yf,  =  N[y  +  m  -  X)] 

where 

^  =  .E  yi(Xi-x)/.L  (Xi-x)2, 
1  =  1  1=1 

the  weighted  least  squares  variance  estimator  is 

Vi(Yfr)  =  VL(Yf,)  =  V4(Yf,){l      (1  -  f)  A//d} 

and  robust  variance  estimators  are 

V2(Yf,)  =  Vh(Y,,)  =  (N  -  n)s2  +  ^      -  n)2 


rf[d  +  (Xi-x)AJ^ 


V3(Yfr)  =  VD(Yf,)  = 


(N-n) 


d2n(n-l) 


E   rf{[d  +  (Xi-x)Aj2+  ^d2}. 
i=l  1-f 

'      d(n-l)  ' 

and  the  standard  variance  estimator  (Cochran  1977)  is 


V4(Yf,)  =  vJY,,)  = 


N(N-n)  , 


where 


s2  =  E  rf/(n  -  2) 
i  =  1 

Ti  =  Yi  -  y  -  b(Xi  -  x) 
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n  n 
=  l-{.i:jd  +  (Xj-x)Aj2}-i  ^i;^  [d  +  (Xi-x)Aj2k. 

kj  =  [1  +  (xj  -  x)2/d]/n 
=  X2  -  X 

n 

d  =  .i:^(Xi  -  x)2/n. 

Royall  and  Cumberland's  (1978)  results,  based  on 
large  sample  theory,  suggest  that  the  variance  estima- 
tors V2  and  V3  may  be  more  robust  than  the  currently 
used  V|  and  V4.  The  weighted  least  squares  variance  es- 
timators may  not  be  robust  enough  in  many  applications. 
V2  and  V3  are  very  similar  to  the  jackknife  variance 
estimator. 

Royall  and  Cumberland  (1981b)  showed  that  under 
random  sampling  the  bias-robust  estimators  V2(Y^j.), 
V3(Yj>j.),  and  the  jackknife  variance  estimator  are  clear- 
ly superior  to  v^(Y^^)  and  V4(Yf  j,)  in  estimating  the  ac- 
tual mean  square  error  as  the  sample  mean  x  varies.  For 
restricted  random  sampling,  balanced  so  that  x  =  X, 
they  found  that 

V4(Y,,)  =  V2(Yf,)  =  v,(Y,,)  <   V3(Y,,)  <  Vj(Y,,) 

where  the  last  one  is  the  average  jackknife  variance  es- 
timator. For  one  population  Vj  was  close  to  the  actual 
mean  square  error.  Vj  appeared  to  be  one  of  the  best  es- 
timators for  the  other  populations  too. 

Royall  and  Cumberland  (1981a)  showed  that  the  esti- 
mators V2  (Yjyy[),  V3(Yfyy[),  and  the  jackknife  variance  es- 
timator are  clearly  superior  to  the  weighted  least  squares 
estimator  v^(Yj^|^)  for  six  study  populations.  They  men- 
tioned that  the  standard  variance  estimator  V4(Yj^)  can 
be  badly  biased  and  proposed  that 

V2(Yrm)   =  Vh(Yrm)  =  V4(Yrm)(X2  X/x2)(1  -  V^/u)"^ 

where 

v2  =  .L^(Xi-x)2/{x2(n-l)} 

be  used  to  adjust  for  this  bias  in  V4(Y|^j^).  This  bias 
problem  with  V4(Yj^i^)  may  be  the  reason  why  it  is  not 
mentioned  further  in  the  results. 

Schreuder  and  Anderson  (1984)  compared  variance 
estimators  for  estimators  of  total  volume  for  2  popula- 
tions of  loblolly  pine  trees  using  Y^r,  the  simple  linear 
(k  =  0)  regression  estimator,  and  Y^j.,  the  weighted  lin- 
ear (k  =  1.5)  regression  estimator.  A  linear  model  with 
k  =  1.5  is  known  to  be  very  reasonable  for  these  pop- 
ulations. They  used  estimators  v^[Y(^),  V2[^(^],  V3(Y^j.), 
V4(Y^r)'  VT(Yfr).  Vj(Y^j,),  and  v^(Y^,)  where  the  latter  is 
the  weighted  regression  estimator  with  k  =  1.5.  Sam- 
ples of  size  20,  40,  60,  and  80  were  used  in  the  simula- 
tion study.  They  recommended  the  use  of  Vj(Y^j.)  for 
Y^j,.  The  jackknife  variance  estimator  Vj(Yfj,)  was  less 
appropriate  for  the  less  appropriate  estimator  Y(^.  For 
Y^j.,  V2(Y^i,)  was  less  reliable  and  VgCY^j.)  was  totally 
unreliable. 


Efron  (1982)  discussed  an  alternative  to  jackknifing, 
called  bootstrapping.  This  method  is  simpler  to  imple- 
ment and  requires  fewer  assumptions  although  it  re- 
quires more  computation.  In  bootstrapping,  Ug  samples 
of  size  n  are  randomly  selected  with  replacement  and 
the  same  estimator  is  used  as  with  the  sampling  strate- 
gy in  question.  The  Ug  estimates  are  then  treated  as  in- 
dependent sample  estimates  and  their  variance  yields 
an  estimate  of  the  variance  of  their  average  value  (the 
bootstrap  estimate)  as  well  as  for  the  original  estimate. 


Objectives 

1.  To  compare  simple  random  sampling  (SRS),  re- 
stricted (balanced)  simple  random  sampling  (RSRS), 
sampling  with  probability  proportional  to  size  (pps), 
stratified  sampling  with  probability  proportional  to  size 
(spps),  purposive  sampling  from  the  cumulated  x-values 
(pscx),  and  stratified  sampling  from  the  cumulated  x- 
values  (spscx)  sampling  designs  with  several  ratio  and 
regression  estimators  in  terms  of  bias,  variance,  and 
mean  square  error  for  a  wide  range  of  populations  from 
actual  forestry  data  sets.  These  contain  sets  with  known 
linear  y-x  relationships  as  well  as  others  where  the  ac- 
tual y-x  relationship  is  more  nebulous.  Other  criteria  to 
be  considered  are  reliability  of  variance  estimator  and 
coverage  rates. 

2.  To  determine  if  at  least  one  reliable  variance  esti- 
mator for  each  sampling  strategy  (design  and  estimator) 
can  be  found. 


Criteria  for  Evaluation  of  Sampling  Strategies 

a.  Average  estimator  bias  in  percent  of  the  true  total, 

Percent  Bias  =  [(Y  -  Y)/Y]  *100%. 

b.  Average  iterated  variance  Vj  for  each  sampling 
strategy. 

c.  Mean  square  error  (MSE)  =  Vj  -H  bias^  for  each 
sampling  strategy,  where  bias  =  Y  -  Y,  and  Vj  = 
10000 

^    (Yi,-Y)2/(10000-1),  Yj.  is  the  estimate  at  the 

k=  1 

k-th  iteration. 

d.  Reliability  of  variance  estimators  expressed  as  a 
percentage  of  the  best  measure  of  the  true  variance, 

Vi; 

i.e.,  [(V  -  Vi)/Vi]  *100%, 

where  V  is  the  average  of  variance  estimates  from 
10,000  iterations. 

e.  Coverage  probability  of  95%  confidence  intervals; 
i.e.,  for  targeted  95%  confidence  intervals,  what 
percentage  actually  contains  the  parameter  of 
interest? 
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Methods 

Sampling  Designs  to  be  Tested 

1.  SRS. — All  units  have  equal  probabilities  of  selection 
and  each  pair  of  units  has  equal  joint  probabilities 

of  selection. 

2 .  RSRS .  — Units  are  selected  that  are  balanced  on  both 
the  mean  and  variance  of  x.  This  is  the  same  as  SRS, 
except  that  samples  with  mean  and  variance  of  x 
more  than  a  certain  percentage  away  from  the  popu- 
lation mean  and  variance  of  x  are  rejected.  lachan 
(1985)  pointed  out  that  there  will  be  sample  units 
with  zero  probabilities  of  selection,  and  he  gave 
modifications  to  avoid  this.  These,  however,  are  cum- 
bersome to  implement,  so  they  were  not  considered 
here. 

3 .  PPS . — Units  are  selected  with  probability  proportion- 
al to  size  X  by  sorting  the  population  by  x  and  sam- 
pling without  replacement  from  the  cumulated 
x-values. 

4.  SPPS. — Units  are  selected  such  that  there  are 
H  =  n/2  strata  with 


The  strata  are  set  up  such  that 

Max  {|Nf  Xj  -  Nf  Xjl} 

is  minimized  for  all  i  and  j.  An  algorithm  was  writ- 
ten to  set  up  the  strata  accordingly.  Two  units  are 
selected  within  each  of  the  H  strata  by  using  pps  sam- 
pling within  each  stratum. 

5 .  PSCX .  — Divide  the  population  sorted  by  x-values  into 
n  classes  [iX-j./n,  (i  -I-  l)Xj/n],  i  =  0,l,...,n  -  1  and 
calculate  the  number  of  units  Nj  in  each  class. 
Within  each  class  select  the  first  unit  that  is  within 
p  =  5%  of  X-p/nNj  (if  no  unit  satisfies  this  criterion, 
p  is  boosted  to  10%,  15%,  etc.,  until  a  unit  is  se- 
lected). Here  Xj  is  the  sum  of  the  x-values  in  the 
population  and  the  units  are  randomly  visited. 

6.  SPSCX.— Randomly  select  1  unit  within  each  of  the 
n  classes  set  up  for  pscx  sampling.  This  is  stratified 
sampling,  selecting  1  unit  per  stratum. 


where  f=n/N,  dj  =  yi-bxi,  and  b  =  (yj/nxj). 

This  is  the  standard  pps  with  replacement  vari- 
ance estimator  which  should  be  satisfactory  for 
without  replacement  sampling  for  the  large  for- 
estry populations  usually  of  interest.  For  the 
stratified  pps  procedures,  the  standard  pps  with 
replacement  variance  estimator 

H  2 

V2  (Yht)        [.^  (yAi)'x^t  -  2  Yht']/2  [6] 

h  =  1  1  =  1 

was  used. 


b.  v,(Yht)  =  (1  -  f)[NX2/f]  E  Jdi/Xi)2/(n  -  1)  [7] 

Vj  is  also  defined  in  equation  [19]. 

This  is  the  jackknife  variance  estimator  Vj,  which 
is  the  pps  with  replacement  variance  estimator  v^ 
(Yp^j)  multiplied  by  the  correction  term  (1  -  f).  It 
is  recommended  for  Yp^-p  by  Cumberland  and 
Royall  (1981).  This  may  perform  better  than 
v^(Yj^'p)  for  small  populations. 

c.  Vg(Yj^'p)  is  shown  in  equation  [21]. 


2-        =  [J^yi/S  x^IXt 


[8] 


a-    Vh  (Yrm)  =  (N/n)  (N  -  n) 

n 

[(n  -  1)  (1  -  c2/n)]-\i;^rf , 


Xx. 


[9] 


where  c^  =  (n-l)-\E  {x^-xf/x^,  ri  =  ^r  (y/xjxj, 


1  =  1 


and  Xg  is  the  sample  mean  of  the  nonsampled 
units. 

b.  Vj(Yj^i^)  is  defined  in  equation  [19]. 


Estimators  to  be  Tested 


1.  Yht  =.f^(yi/nxi)  X^  [4] 

or  for  the  stratified  pps  procedures  (spps) 
H  2 

^HT  =^E^  .i;^(yhi/2Xhi)Xht, 

where  Xj^j  =  total  for  x  in  stratum  h. 
Variance  estimators  are 


a.  V,  (Yht)  =  (NX^/f)  .E^  (di/xi)2/(n-l)  [5] 


^-    ^b(Yrm)  = 


iti(YBi- V 
(nn  -  1) 


or  as  defined  in  equation  [21]. 
3.  Ylr  =  N  [a      13  X] 
with  variance  estimators 


a.  v(Ylr)  = 


N2(1-  ^) 


[E(y:-y)2  - 


n(n-2) 
{2:(yi-y)  (Xi-x)}2 


[10] 


[11] 


2(Xi-x)^ 

which  is  the  standard  that  is  used  for  compari- 
son purposes  (Cochran  1977). 


5 


b.  Vh(Ylr)  =  (N  -  n)s2  +  (N  -  n)^ 


[12] 


where  d  =  D  (x-  -  x)  /n, 
i=  1 

Ax  =  X2  -  X, 


-1 


=  1-  {.i:  [d  +  (Xi-x)Aj2} 


2\-l 


1=1 


.Ejd  +  (Xi-x)Aj2kj 

where  kj  =  [1  +  (xj  -  x)^/d]/n. 

c.  Vj  (Yj-j.)  is  defined  in  equation  [19]. 

d.  Vg  (Y^j.)  is  defined  in  equation  [21]. 


4.  Yblu  =  [ 


1.5 


+  /5rt,ii(X 


i  =  1 


BLU  -  L  T  MBLU*.--  n 

L  1/x/-^  L  l/x,i-5 


i  =  1 
n 


i=l 


+  .i;^Di/Nni]*N 


[13] 


where 


1=1  1=1 


^BLU  -  E  E 


i=l  i=l 


E  l/x.1-5  Ex2/x.^-^-  (  EXi/Xi^-5)2 

i=l      ^     i=i   ^    ^  i=i 


and 


1.5 


Di  =  Yi  

1      J'  1  n 


i  =  1 


i=  1  ' 

(Sarndal  1980) 
with 

a.  Vj  (Ygnj]  as  given  in  equation  [19] 

b.  Vg(YgLu)      given  in  equation  [21] 

c.  v^(Y3nj)  as  given  in  equation  [17]. 

n 

5.  YpR  =  [.L  Yj  +  E  /IJ  =  ny  +  E  /Ij 

1  =  1         les  Its 

where  /xj  =  a  +  |Sxj 

where  ^  =  ^qlu 

and  a  =  agLU  =  Y  -  ^BLU 

with 

a.  Vj(Ypj^)  as  given  in  equation  [19] 


[14] 


b.  Vg(Ypj^}  as  given  in  equation  [21] 

c.  v^(Ypj^)  as  given  in  equation  [17].  We  will  use 
k  =  1.5  at  this  time. 

6.  YwR  =  N[a^  +  ^^X]  [15] 
with 

and 

.|l/x>.Ey.xJ-k_|^^l-k|^y.^-k 
i  =  l         i=l  '  i=l  ^ 

Y^  and  YgL^      ^h^  same  except  that  Ygnj  contains 

n 

the  error  correction  L  D^/nll:  with  variance  estimators 
i=i  * 

a.  V  (Yy^)  as  defined  in  equation  [17]  with  = 

b.  Vj  (Y^)  as  defined  in  equation  [19] 

c.  Vg(Y^j^)  as  defined  in  equation  [21]. 


I  yj/n. 


Z  x,/n, 


i  =  1 

7-    Ypi  =  [  V  

L  i/n,. 

i=  1 

(Sarndal  1980) 
with 


i  =  1 


+  /3pi(X  ^ 


]]  *  N  [16] 


L  i/n,- 


i=  1 


n  n  n  n 

L  i/n,-  L  Xiy/n,.-  i:  y:/ni  E  xj/nj 

i=i     ^i=i        1  1=1-'^    M=i  '  ^ 

E  i/n,  E  x?/n.  -  (  £  x:/n, 


Here  Ilj  is  the  probability  of  selecting  unit  i  in  the 
sample,  which  is  taken  to  be 

=  n/N    for  SRS  and  RSRS 

=  1/Ni   for  spscs  where  Nj  =  the  number  of  units 

in  the  n  strata  set  up 
=  nxj/Xffor  pps  and  spps,  where      =  X-p  for  pps, 

Xt  =  Xht  for  spps  (h  =  l,...,  H). 

Clearly  the  probabilities  used  are  only  approximations 
for  RSRS. 

a.  with  Vyj  (Ypj)  as  defined  in  equation  [17] 

b.  with  Vj  (Ypj)  as  defined  in  equation  [19] 

c.  with  Vg  [Ypj)  as  defined  in  equation  [21]. 

The  variance  formulas  for  Ypj,  YgLu-  ^PR 
quite  messy.  Sarndal  (1982)  proposes  a  variance  esti- 
mator for  generalized  regression  estimators  of  linear 
functions  (Schreuder  et  al.  1987): 

v(Ygr)  =  s2[Enj-£ni  +  {[( J  xf/ni)(N  -  n) 
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-2[Xj  -  n  x)  E^(Xi/ni)]  (N  -  n)/ E^l/Hj 

+  (XT-nx)2]/SS,^}  [17] 
where  GR  =  PI  or  BLU  or  PR  and 

s2  =  [J^yf/n;  -  (J^Yi/n// L  l/Hi  -  i32sS^J/(n  -  2) 
with  iS  being  either  /Spj,  i8blU'  °^  '^PR 

n  n  n 

'^'^        i=l    '        '         1=1    '  i=l  ' 

with  11;  =  the  probability  of  selecting  unit  i  in  a  sam- 
ple of  n  units  for  v(Ypj)  and  U.^  =  Xj  ,  where  we  use 
k  =  1.5  in  this  study  for  v(Y3lu)  and  v(Ypj^). 

Proposed  General  Robust  Variance  Estimators 

Both  the  jackknife  and  the  bootstrap  variance  esti- 
mators seem  to  be  promising  robust  variance  estimators 
for  complex  sampling  strategies  (Efron  1982)  and  hence 
were  tested  in  this  simulation  study. 

The  jackknife  estimator,  generally  defined,  is 


i  =  1 


[18] 


where  n  is  the  sample  size  and 

Y,i  =  n  Y  -  (n  -  l)Y(i) 

where  Y  is  the  standard  estimator  used  and  Y^^  is  the 
standard  estimator  with  the  i-th  unit  deleted.  Then 

=  (Yji  -  Yj)2/n(n  -  1).  [19] 


The  bootstrap  estimator  is 


Yb  =  VnB- 


[20] 


where  Ug  =  number  of  bootstrap  samples  selected,  and 
Yg^  is  the  i-th  bootstrap  estimate.  This  estimator  has 


variance 


v(Yb]  =  .E^(YBi-YB)MnB-l). 


[21] 


Weighted  jackknife  and  bootstrap  variance  estimators 
were  not  considered  because  the  theory  was  not  suffi- 
ciently advanced  at  the  time. 


ping  takes  considerable  computing  time  so  only  2,000 
samples  are  selected  for  bootstrapping.  Jackknifing  was 
done  by  simply  deleting  each  of  the  20  sample  units  one 
at  a  time,  generating  20  pseudo-estimates  for  each  of  the 
7  estimators,  and  computing  the  variance  between  them 
using  equation  [19]. 

In  bootstrapping  for  regression  the  following  approach 
was  used  for  each  sample  selection  method: 

1.  The  regression  coefficients  a  and  |S  were  computed. 
In  the  case  of  Ypj, 


«PI  = 


(.£  Y^/U^ViL  l/(ni)-^pi(.E  xj/nj)/ £  l/Hi 
1  =  1  1  =  1  1  =  1  1=1 


2.  e;  =  yj  -  apj  -  jSpj  Xj  for  all  n  values  were  computed. 

3.  n  values  were  drawn  with  replacement  from  the 
ej-values  and  added  to  yj  =  api  +  i8pi  Xj  so  that  n 
values  y;  =  yj  +  e^  were  obtained. 

4.  apj  and  /Spj  for  the  n  values  of  yj*,  X;  were  computed. 

5.  Ypi(B)  =  N  (aPi  -1-  iSPi  X)  was  computed. 

Then  the  variance  between  the  Ug,  estimates  Ypj  (B) 
yielded  the  bootstrap  variance  for  Ypj.  The  bootstrap- 
ping technique  described  above  is  for  simple  random 
sampling.  For  other  sampling  designs  the  probability  of 
selection  for  bootstrapping  should  mimic  the  probabil- 
ity of  selection  for  the  original  sample. 

Similarly  for  Yblu 

«BLU  =  C  J^YiV-'  ]/El/X;^-^ 


-h^BLulJ  Di/(N  Hi)  -  (J  x/x>5  Viiyx,'-^  )} 
and  for  Ypj^  we  used 

«PR  =  "BLU 
I^PR  -  ^BLU  ■ 

For  each  estimator  we  computed  both  the  average 
bootstrap  estimate  (i.e.,  Yg^y  (B))  and  its  average  vari- 
ance estimate. 

For  SRS,  RSRS,  pscx,  and  spscx,  we  randomly  se- 
lected n  units  out  of  n  units  with  replacement  and  com- 
puted for  the  b-th  sample  (b  =  1,  Ug): 


Yht   =  (.i:^yi/nxi)XT 


Simulation  Statistics 

For  each  of  the  6  sample  designs  10,000  samples  of 
20  units  were  selected.  For  each  sample  the  7  estimates 
and  the  variance  estimators  were  computed.  This  was 
done  for  each  population.  Bootstrapping  with  Ug  =  50 
was  done  for  each  sample  as  described  below.  Bootstrap- 


Yrm'  =  (J,yi*/J,'^i*)XT 

and  then  computed  the  variability  between  the  ng  esti- 
mates using  equation  [21].  For  each  estimator  we  were 
interested  in  both  the  average  bootstrap  estimate  (i.e., 
Yht        ^^'^      average  variance  estimate. 
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For  pps  we  did  the  following: 

1.  Drew  n  units  with  pps  with  replacement  from  the  n 
sample  units  (cumulated  the  x-values  in  the  original 
sample;  selected  n  random  numbers  between  1  and 
the  sum  of  the  x's  and  selected  the  units  correspond- 
ing to  the  selected  random  numbers). 

n 

2.  Computed  Yj^j^  =  D^(yj*/n  Xj*)X'p 
and 

Yrm'  =  c£  y;/J^xr)XT 

and  computed  the  variability  between  estimates 
using  equation  [21].  For  spps,  we  drew  2  units  by  pps 
with  replacement  from  the  2  units  in  each  stratum 
and  computed 

and 

Rao  and  Wu  (1988)  point  out  that  such  nonlinear  es- 
timators as  the  ratio  estimators  are  not  consistent  esti- 
mators of  the  true  variances  of  Yj^-p  (and  Yj^jy[).  They 
suggest  as  alternatives  for  the  following  estimators 
that  do  yield  consistent  variance  estimators  of  the  true 
variances  of  Y^^j.  To  do  this  we  need  estimated  totals 
for  each  stratum  for  the  original  sample;  i.e. ,  for  stratum 
h: 

YHT(h)  =  .i;jyhi/2  Xhi)Xht 

where  Xj^^  is  the  total  for  stratum  h. 

Then  for  the  with-replacement  sample  they  compute 

Yht[rw)  =^^^%.  b  =  1,...,B 

where  for  the  two  units  in  each  stratum 

Zh  =  YHT(h)  +V2{(YHT*(h)  -  YHT(h)} 

and  2 

YHT(h)  =  .i:^(yhi/2Xhi)Xht 

Then  compute  the  bootstrap  variances  of  Y^j  (RW) 
using  equation  [21].  We  are  interested  in  the  average 
bootstrap  variance  estimates. 

Populations 

1 .  Simulated  loblolly  pine  population.— This  data  set 
of  1,795  trees  was  generated  from  the  loblolly  pine 
volume  data  set  (population  2)  (6  trees  with  negative 


values  for  y  were  eliminated).  The  variables  are  total  bole 
volume  in  cubic  feet  (v^,)  and  tree  d.b.h.  in  inches 
squared  times  total  height  (D^H)  in  feet.  For  each  tree 
the  value  of  v^,  was  computed  from  the  following 
equation: 

Vp  =  -0.734  +  0.002129  D^H  -I-  e 
with  residual  error  e  generated  from 

N(0,  a^iB^H)^-^), 

where  =  0.00001.  The  regression  coefficients  were 
obtained  by  fitting  weighted  regression  (k  =  1.5)  to  the 
loblolly  pine  volume  data  (see  population  2).  The  popu- 
lation skewness  of  D^H  is  2.552,  and  the  kurtosis  is 
8.047.  This  data  set  was  used  to  check  how  the  estimates 
perform  with  normally  distributed  errors. 

2.  Loblolly  pine  population. — This  data  set  consists 
of  N  =  1,801  Pinus  taeda  trees  with  measurement 
Vj,  =  total  bole  volume  (cubic  feet),  D  =  diameter  breast 
height  at  4  feet  5  inches  (in  inches)  and  total  height  (in 
feet).  The  data  set  was  selected  from  the  large  data  set 
discussed  in  McClure  et  al.  (1983)  so  as  to  reflect  the  ac- 
tual diameter  distribution  in  the  Coastal  Plains  of  the 
southeastern  United  States  (McClure  et  al.  1987).  The 
weighted  regression  relationship  for  the  data  set  that 
describes  the  actual  relationship  quite  well  is 

v^  =  -0.734  +  0.002129  D^H  -I-  e 

with  v(e)  =  (D^H)*^,  where  e  =  residual  error, 
k  =  1.5  =  a  constant  of  heterogeneity,  and  =  0.98 
for  simple  linear  regression.  The  population  skewness 
of  D^H  is  2.556,  and  the  kurtosis  is  8.069.  This  data  set 
was  also  used  in  Schreuder  and  Thomas  (1985). 

3.  Pinus  radiata  permanent  yield  plot  data,  Kowen 
Forest,  Australia  Capital  Territory. — The  variables  are 
merchantable  volume  (in  m'^)  and  D^H  (in  cm^m).  This 
population  comprises  47  plots  and  2,761  trees.  Plots 
were  randomly  located  within  site  index  classes  within 
compartments  in  1955  and  remeasured  in  1958,  1962, 
and  1971.  Some  plots  were  measured  at  other  times  for 
special  purposes.  Plot  size  is  2  by  1  chain  or  0.08  ha. 

The  diameter  of  each  stem  was  measured  at  breast 
height  (4  feet  3  inches)  and  four  bark  thickness  meas- 
urements were  taken  at  5  feet  and  15  feet,  to  derive  un- 
derbark  taper.  Total  height  was  measured  by  Haga 
altimeter. 

Merchantable  volume  of  the  trees  was  derived  from 
a  4-variable  tree  volume  table  for  Pinus  radiata  (Lewis 
et  al.  1973).  Volume  to  a  10-inch  diameter  inside  bark 
(Vjij)  in  1955  and  D^H  in  1955  were  used  as  variables. 
The  weighted  regression  (k  =  1.5)  was 

=  -0.02085  -I-  0.0000258  D^H 

and  the  simple  regression  was 

Vjb  =  -0.0177  -I-  0.0000254  D^H  (R^  =  0.9517). 

The  weighted  regression  describes  the  actual  relation- 
ships of  the  variables  quite  well.  The  population  skew- 
ness of  D^H  is  2.088,  and  the  kurtosis  is  5.278. 

4 .  Southern  imdisturbed  remeasurement  plots .  — This 
data  set  consists  of  275  remeasured  undisturbed  forest 
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plots  in  the  southern  United  States.  Net  plot  cubic  foot 
volumes  at  times  t(Vf)  and  t  -5  (V(_5]  are  used.  Plots  were 
measured  5  years  apart.  The  weighted  regression 
(k  =  1.5)  is 

v^  =  206.37  +  0.812  V,_5 

and  the  simple  linear  regression  is 

Vf  =  141.55  +  0.863  Vt_5 

(R^  =  0.73  for  simple  linear  regression).  This  is  a  good 
description  of  the  data  although  whether  k  =  1.5  can- 
not be  verified  reliably  for  such  a  small  data  set.  The 
population  skewness  of  v^_^  is  1.331,  and  the  kurtosis 
is  5.482. 

5.  Pine  Creek  periodic  management  inventory  data, 
Australia  (N  =  2,748). — Tree  diameters  [in  cm)  meas- 
ured in  1979  (D79)  and  in  1967  (D67)  are  used  as  varia- 
bles. From  1958  to  1961,  114  5-  by  1-chain  (0.5-acre  or 
0.202-ha)  rectangular  plots  were  located  at  random  with- 
in a  native  eucalypt  forest.  Plots  were  established  in  each 
of  43  compartments  depending  on  the  size  of  the  com- 
partment. Two,  three,  or  four  plots  were  distributed 
within  a  compartment  according  to  size  of  the  compart- 
ment and  the  relative  area  of  forest  types  so  that  the  units 
were  allocated  roughly  proportional  to  the  area  by  types. 
Seventy-one  plots  were  established  in  the  "ridge  type" 
(blackbutt  and  mixed  hardwood  types)  and  43  in  the 
"gully  type"  (the  flooded  gum  type).  One  of  the  gully 
plots  was  destroyed  in  1962. 

The  remaining  plots  were  remeasured  in  1963  and 
every  1-3  years  thereafter  up  to  1981  or  until  destroyed 
or  discarded.  The  variables  and  their  weighted  and  sim- 
ple linear  regression  are 

D79  =  1.906  -I-  1.089  D67  for  weighted  regression  and 
D79  =  3.246  +  1.052  D67  for  simple  linear  regression 
(R^  =  0.9463).  The  population  skewness  of  D67  is 
2.085,  and  the  kurtosis  is  7.250. 

6.  Alabama  remeasurement  data. — The  data  set  con- 
sists of  1,905  remeasured  undisturbed  volume  plots  in 
Alabama.  Plot  cubic  foot  volume  measured  in  1982 
(^1982^  1973  (vjgjg)  were  used.  The  weighted  regres- 
sion relationship  is 

v^gg2  =  639.78  +  0.829  v^gyg       fork  =  1.5. 

There  is  a  definite  linear  relationship  between  the  vari- 
ables but  it  is  not  clear  that  there  is  heterogeneity  of  var- 
iance. The  simple  linear  regression  is 

^1982  =  632.39  -I-  0.856  v^973  (k  =  0) 

[R^  =  0.5831  for  simple  linear  regression). 

The  population  skewness  of  v^gj^  is  1.327,  and  the 
kurtosis  is  2.515.  This  data  set  was  used  in  Schreuder 
and  Thomas  (1985)  too. 

7.  Pinus  radiata  bark  data,  Australian  Capital  Ter- 
ritory (A.C.T.),  Australia. — Relative  bark  thickness 
(RBT)  (ratio  of  bark  thickness  at  a  point  on  the  bole  to 
the  bark  thickness  at  breast  height)  and  relative  diameter 
to  the  fourth  power  (RD'*)  (ratio  of  diameter  over  bark 
at  the  same  point  on  the  bole  to  d.b.h.  outside  bark)  were 
used  as  variables.  The  data  comprise  measurements 
made  by  diameter  tape  both  outside  and  inside  bark  at 


1-  to  2-m  intervals  along  the  boles  of  446  trees  ranging 
in  age  from  18  to  45  years  (number  of  measurements 
=  5,142). 

The  data  were  derived  for  the  purpose  of  developing 
tree  volume,  taper,  and  bark  thickness  functions  for  radi- 
ata pine  in  the  A.C.T.  and  derived  from  plantation 
forests.  The  weighted  linear  regression  is 

RBT  =  0.1046  +  1.1060  RD^ 

and  simple  linear  regression  is 

RBT  =  0.1316  +  0.7904  RD^  (R^  =  0.8817). 

The  population  skewness  of  RD"*  is  1.355,  and  the  kur- 
tosis is  1.130. 

8.  New  York  (NY)  remeasurement  data. — The  data 
set  consists  of  622  remeasured  volume  plots  in  New 
York.  It  excludes  observations  with  zero  values.  The 
variables  are  gross  cubic  foot  volume  per  acre  of  live  trees 
(GAL)  at  times  t  and  t-1.  The  weighted  regression  is 

GAL^  =  698.7566  +  0.8802059  GAl^_^. 

The  simple  linear  regression  is 

GALt  =  1053.706  +  0.53  13058  GAL^_-^  (R^  =  0.2156). 

The  population  skewness  of  GALf_j  is  0.987,  and  the 
kurtosis  is  1.832. 

Results  and  Discussion 

In  a  study  of  this  type,  the  amount  of  results  gener- 
ated easily  overwhelm  one's  ability  to  absorb  what  is 
going  on.  Hence  we  used  a  sequential  approach.  We 
evaluated  estimator  bias  as  a  percentage  of  the  true  total, 
where  less  than  0.20%  bias  (<  0.20%)  is  considered  no 
bias,  0.20-1.00%  is  small  bias,  1-5%  is  serious,  and 
>  5%  is  very  serious  bias.  For  variance  estimation,  bias 
of  less  than  5%  (of  the  iterated  variance)  is  considered 
no  bias,  5-10%  is  small  bias,  10-20%  is  moderate  bias, 
20-50%  is  serious  bias,  and  >50%  is  very  serious  bias. 
For  all  populations,  the  results  for  10,000  iterations  are 
given,  except  that  only  2,000  bootstrap  samples  could 
be  run.  There  was  little  or  no  change  in  results  from 
2,000  to  10,000  iterations,  indicating  that  all  average  es- 
timates had  become  quite  stable. 

Estimation  Bias  (table  1) 

If  an  unbiased  estimator  is  wanted  across  all  pop- 
ulations, clearly  only  pps  and  spps  with  YHT  satisfy  this 
requirement.  Only  YPI  with  spps,  YLR  ( =  YPI)  with  SRS 
and  RSRS;  and  YBLU,  YWR,  and  YPI  with  spscx  have 
essentially  no  estimation  bias  across  all  eight  popula- 
tions. 

If  only  the  populations  are  considered  where  lineari- 
ty is  a  reasonable  assumption  (populations  1-6),  YBLU 
and  YWR  are  essentially  unbiased  across  all  six  popu- 
lations for  all  methods  except  pscx.  Both  estimators 
show  no  bias  for  spps.  For  these  six  populations,  all  SRS 
and  RSRS  regression  estimators  show  small  or  no  bias 
as  does  YPI  for  spscx.  For  populations  7  and  8,  where 
the  assumption  of  linearity  is  less  reasonable,  no  unbi- 
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Table  1 . — Bias  expressed  as  percent  of  the  true  total  for  the  seven  classical  estimators  (mean  square 
errors  in  parentheses)  with  the  six  sample  selection  methods  with  n  =  20  and  10,000  iterations 
from  each  population  (2,000  iterations  for  bootstrap  variance  estimates)^ 


Population 


Sampling 
design 

YRM  YHT 

Y  Ln 

Y  DLU 

vdd3 
Y  rn 

V\A/D 
Y  vVn 

VD|4 
Y  rl 

Simulated  loblolly  pine,  volume  (MSE  * 

lU  ),  vi  = 

1  liSO 

SRS 

-0.56 

-0.14 

-0.13 

-0.13 

-0.19 

-0.14 

(1.24) 

(1.26) 

(1.13) 

(1.13) 

(1.07) 

(1.26) 

RSRS 

-0.03 

0.01 

0.01 

0.01 

-0.09 

0.01 

(1.06) 

(1.06) 

(1.05) 

(1.05) 

(0.99) 

(1.06) 

pps 

0.02 

-0.31 

-0.04 

0.10 

-0.01 

-0.05 

(0.80) 

(2.11) 

(0.80) 

(0.89) 

(0.79) 

(1.81) 

spps 

-0.02 

-0.13 

-0.03 

0.05 

-0.02 

-0.03 

(0.67) 

(0.89) 

(0.67) 

(0.76) 

(0.66) 

(0.67) 

spscx 

-0.37 

-0.05 

0.11 

-0.03 

-0.05 

(1.98) 

(0.69) 

(0.79) 

(0.66) 

(0.68) 

pscx 

-1.49 

0.07 

0.07 

-0.11 

(2.00) 

(0.73) 

(0.73) 

(0.64) 

Loblolly  pine, 

volume  (MSE  *  10^),  N  = 

1  oUI 

SRS"* 

-0.62 

-0.49 

-0.20 

-0.20 

-0.58 

-0.49 

(1.31) 

(1.01) 

(1.12) 

(1.12) 

(0.77) 

(1.01) 

RSRS" 

-0.90 

-0.86 

-0.87 

-0.87 

-0.96 

-0.86 

(0.83) 

(0.84) 

(0.83) 

(0.83) 

(0.69) 

(0.84) 

pps 

0.04 

-6.16 

-0.19 

1.67 

0.08 

-0.34 

(0.58) 

(6.33) 

(0.53) 

(1 .36) 

(0.58) 

(0.64) 

spps 

-0.03 

-1.46 

-0.05 

1.11 

0.11 

-0.05 

(0.56) 

(0.70) 

(0.56) 

(1.40) 

(0.60) 

(0.56) 

spscx 

-6.29 

-0.00 

1.92 

0.25 

-0.01 

(4.88) 

(0.43) 

(1.16) 

(0.46) 

(0.43) 

pscx 

-4.44 

0.56 

0.56 

-0.44 

(2.28) 

(0.62) 

(0.62) 

(0.42) 

Pinus  radiata,  volume  (MSE  *  10^),  N  = 

SRS 

-0.42 

0.20 

-0.01 

-0.01 

0.06 

0.20 

(7.30) 

(7.00) 

(6.47) 

(6.47) 

(5.90) 

(7.00) 

RSRS 

0.34 

0.42 

0.41 

0.41 

0.42 

0.42 

(6.11) 

(6.15) 

(6.05) 

(6.05) 

(5.61) 

(6.15) 

pps 

-0.00 

1.87 

-0.05 

-0.56 

-0.05 

0.12 

(5.34) 

(10.84) 

(4.67) 

(5.41) 

(4.70) 

(4.83) 

spps 

-0.01 

0.56 

-0.01 

-0.41 

-0.06 

-0.01 

(4.38) 

(4.82) 

(4.37) 

(5.15) 

(4.40) 

(4.38) 

spscx 

2.06 

-0.03 

-0.75 

-0.14 

-0.03 

(9.33) 

(4.15) 

(4.91) 

(4.13) 

(4.14) 

pscx 

3.69 

-1.69 

-1.69 

-0.54 

(11.73) 

(5.59) 

(5.59) 

(4.22) 

Southern,  remeasured  (MSE  *  10^),  N 

SRS 

0.09 

-0.26 

-0.24 

-0.24 

-0.55 

-0.26 

(8.19) 

(8.28) 

(8.36) 

(8.36) 

(8.56) 

(8.28) 

RSRS 

-0.08 

-0.16 

-0.18 

-0.18 

-0.46 

-0.16 

(8.16) 

(8.17) 

(8.14) 

(8.14) 

(8.12) 

(8.17) 

pps 

0.19 

-1.15 

-0.08 

0.33 

0.03 

-0.13 

(10,24) 

(9.45) 

(9.13) 

(10.27) 

(9.38) 

(9.03) 

spps 

-0.05 

-0.50 

-0.14 

-0.13 

-0.09 

-0.13 

(7.83) 

(7.15) 

(7.31) 

(8.11) 

(7.28) 

(7.39) 

spscx 

-0.76 

-0.09 

0.25 

-0.03 

-0.10 

(8.28) 

(8.50) 

(10.10) 

(8.75) 

(8.52) 

pscx 

-0.57 

-2.33 

-2.33 

-1.94 

(7.33) 

(7.28) 

(7.28) 

(6.91) 

Pine  Creek,  diameter  (MSE  *  10^),  N  = 

2748 

SRS 

0.05 

n  1 7 

U.  1  / 

0  14 

0.14 

0.22 

0.17 

(6.77) 

(6.52) 

(6.70) 

(6.70) 

(6.72) 

(6.52) 

RSRS 

0.11 

0.10 

0.11 

0.11 

0.17 

0.10 

(6.31) 

(6.24) 

(6.30) 

(6.30) 

(6.37) 

(6.24) 

pps 

0.01 

0.42 

0.11 

-0.06 

0.07 

0.11 

(6.93) 

(7.36) 

(7.13) 

(7.31) 

(7.14) 

(7.10) 

spps 

0.03 

0.16 

0.05 

-0.10 

-0.02 

-0.05 

(6.13) 

(6.12) 

(6.10) 

(6.21) 

(6.12) 

(6.09) 

spscx 

0.33 

-0.01 

-0.19 

-0.05 

-0.01 

(6.80) 

(6.60) 

(6.71) 

(6.59) 

(6.59) 

pscx 

0.39 

-0.09 

-0.09 

0.03 

(6.85) 

(6.69) 

(6.69) 

(6.62) 
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Table  1. — Continued 


Population 


Sampling 

design  YRM 

YHT 

YLR2 

YBLU 

YPR3 

YWR 

YPI" 

6. 

Alabama,  remeasured  (MSE  *  10^^), 

N  =  1905 

SRS  1.20 

0.26 

0.14 

0.14 

0.60 

0.26 

(1.04) 

(0.60) 

(0.69) 

(0.69) 

(0.86) 

(0.60) 

RSRS  0.40 

0.16 

0.16 

0.16 

0.64 

0.16 

(0.62) 

(0.57) 

(0.58) 

(0.58) 

(0.71) 

(0.57) 

pps 

0.22 

1.81 

0.59 

-0.02 

0.44 

0.57 

(1.88) 

(0.84) 

(0.89) 

(1.08) 

(0.92) 

(0.87) 

spps 

0.05 

0.77 

0.13 

-0.45 

0.14 

0.12 

(0.80) 

(0.58) 

(0.59) 

(0.72) 

(0.60) 

(0.60) 

spscx 

1.47 

0.14 

-0.71 

-0.09 

0.12 

(0.75) 

(0.82) 

(1.16) 

(0.88) 

(0.82) 

pscx 

1.39 

0.17 

0.17 

0.58 

(0.78) 

(1.30) 

(1.30) 

(1.00) 

7. 

Pinus  radiata,  bark  (MSE  * 

lO"*),  N  = 

5142 

SRS  2.75 

-0.58 

1.24 

1.24 

9.52 

-0.58 

(5.27) 

(0.81) 

(3.49) 

(3.49) 

(17.46) 

(0.81) 

RSRS  -0.09 

-0.36 

-0.19 

-0.19 

8.61 

-0.36 

(0.87) 

(0.71) 

(0.79) 

(0.79) 

(11.29) 

(0.71) 

pps 

0.02 

-8.20 

-0.04 

1.90 

0.27 

-1.07 

(57.59) 

(3.79) 

(2,25) 

(2.93) 

(2.34) 

(2.23) 

spps 

0.44 

-1.95 

1.44 

2.39 

1.58 

0.77 

(38.36) 

(0.94) 

(0.90) 

(1.64) 

(0.90) 

(0.86) 

spscx 

-7.89 

0.22 

2.05 

0.30 

0.20 

(2.97) 

(1.85) 

(2.91) 

(1.92) 

(1.85) 

pscx 

-7.14 

6.52 

6.52 

3.57 

(2.69) 

(4.46) 

(4.46) 

(2.59) 

8. 

New  York,  remeasured  (MSE  *  10"), 

N  =  622 

SRS  1.63 

0.95 

0.79 

0.79 

3.50 

0.95 

(0.26) 

(0.14) 

(0.17) 

(0.17) 

(0.23) 

(0.14) 

RSRS  1.23 

0.96 

1.01 

1.01 

3.47 

0.96 

(0.14) 

(0.13) 

(0.14) 

(0.14) 

(0.19) 

(0.13) 

pps 

-0.11 

8.58 

2.43 

0.35 

1.91 

2.67 

(0.46) 

(0.29) 

(0.23) 

(0.28) 

(0.24) 

(0.23) 

spps 

-0.01 

3.48 

0.91 

-0.89 

0.77 

0.92 

(0.19) 

(0.14) 

(0.15) 

(0.19) 

(0.15) 

(0.15) 

spscx 

7.41 

0.38 

-2.62 

-0.47 

0.33 

(0.24) 

(0.21) 

(0.30) 

(0.23) 

(0.21) 

pscx 

8.87 

3.47 

3.47 

4.57 

(0.26) 

(0.35) 

(0.35) 

(0.29) 

Estimation  bias  rating:  <0.20%,  no  bias:  0.20-1.00%,  small  bias;  1 

-5%,  serious  bias;  >  5%,  very 

serious  bias. 
^YLR  =  YPI  for  SRS  and  RSRS. 
^YBLU  =  YPR  for  SRS  and  RSRS. 
"^A/of  appropriate  for  pscx. 

ased  estimators  were  found  that  were  not  also  essential- 
ly unbiased  for  the  linear  populations. 

Mean  Square  Error  (tables  1  and  2) 

In  terms  of  mean  square  error,  SRS  and  RSRS  show 
a  remarkably  consistent  pattern  for  the  eight  popula- 
tions. With  all  estimators  RSRS  basically  has  lower  mean 
square  error  than  SRS,  in  some  cases  considerably  low- 
er (YRM  and  YLR  in  population  2,  YRM  in  population 
3,  and  YRM,  YBLU.  YPR.  and  YWR  in  population  7). 
In  populations  5.  6,  7,  and  8  YLR  is  the  most  efficient 
estimator.  The  other  regression  estimators  show  very 
little  difference  in  performance  for  each  population  ex- 
cept that  YWR  is  much  less  efficient  than  the  others  in 
population  7. 

With  pps  sampling  YBLU,  YWR,  and  YPI  tend  to  be 
the  most  efficient  estimators  throughout.  YHT  is  equal- 
ly efficient  in  populations  1,2,  and  5.  It  is  much  less 


efficient  in  populations  3,  4.  6.  8,  and  especially  7.  YLR 
is  much  less  efficient  than  YBLU,  YWR.  and  YPI  in 
populations  1,  2,  and  3,  and  somewhat  less  efficient 
generally  in  the  other  populations.  YPR  is  less  efficient 
than  YBLU.  YPI,  and  YWR  in  populations  2,  3,  4,  6.  and 
7  and  about  equally  efficient  in  the  other  populations. 

With  spps  sampling  YBLU,  YWR,  and  YPI  tend  to  be 
consistently  the  most  efficient  estimators  and  show  very 
little  difference  among  them.  YLR  is  not  efficient  in 
populations  1.2.  and  3.  YPR  is  inefficient  in  all  popu- 
lations. YHT  is  efficient  in  populations  1,  2,  3,  and  4 
but  inefficient  in  5.  6.  8.  and  especially  in  7. 

With  spscx  sampling  YBLU,  YPI,  and  YWR  consistent- 
ly give  the  most  efficient  estimators  and  there  is  little 
difference  between  the  three  estimators.  YLR  is  quite  in- 
efficient for  populations  1,  2,  3,  and  7.  YPR  is  generally 
inefficient  in  all  populations. 

With  pscx  sampling  YWR  is  consistently  the  most  ef- 
ficient estimator  except  in  populations  6  and  8  where 
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Table  2.  Best  mean  square  error  sampling  strategies  for  each  population.  (The  values  in  parentheses 
have  already  been  scaled  for  each  population.) 


Population 


1 . 

pscx,  YWR 

spscx,  YWR 

spps,  YWR 

(0.64) 

(0.66) 

(0.66) 

2. 

pscx,  YWR 

spscx,  YPI 

spscx,  YBLU 

(0.42) 

(0.43) 

(0.43) 

o 

o. 

SpSCS,  YWH 

spscx, YPI 

spscx,  YBLU 

(4.13) 

(4.14) 

(4.15) 

4. 

pscx,  YWR 

spps,  YLR 

spps, YWR 

(6.91) 

(7.15) 

(7.28) 

5. 

spps,  YPI 

spps,  YBLU 

spps,  YLR 

(6.09) 

(6.10) 

(6.12) 

6. 

RSRS,  YLR 

RSRS,  YBLU 

spps,  YLR 

(0.57) 

(0.58) 

(0.58) 

7. 

RSRS,  YLR 

RSRS,  YBLU 

SRS,  YLR 

(0.71) 

(0.79) 

(0.81) 

8. 

RSRS,  YLR 

RSRS,  YBLU 

RSRS,  YPR 

(0.13) 

(0.14) 

(0.14) 

YLR  is  clearly  best.  YLR  is  quite  inefficient  in  popula- 
tions 1,2,3,5,  and  to  a  lesser  extent  in  4.  YBLU  ( =  YPR 
for  pscx)  is  only  efficient  in  population  5. 

The  most  efficient  sampling  strategies  for  each  popu- 
lation are  shown  in  table  2.  The  most  efficient  sampling 
designs  generally  are  spps,  spscx,  and  pscx  for  the  popu- 
lations with  strong  linear  relationships  and  heterogene- 
ous error  structure  (1-4).  There  is  no  estimator  that  is 
always  best  for  these  three  sample  selection  methods  but 
YWR  is  quite  good  in  many  instances,  spps  is  clearly 
the  best  sample  selection  method  with  any  regression 
estimator  in  population  5.  In  population  6  with  a 
homogeneous  error  structure  and  populations  7  and  8 
with  weak  linear  relationships,  RSRS  is  best  with  the 
simple  linear  regression  estimator  YLR,  closely  followed 
by  RSRS  with  YBLU. 

Conclusions  on  mean  square  error  are  as  follows: 

1 .  RSRS  always  has  lower  MSE  than  SRS  for  most  esti- 
mators. 

2.  There  is  no  consistently  best  sampling  strategy  for 
all  populations,  even  for  all  populations  with  a  strong 
linear  y  -  x  relationship. 

3.  Generally  spps,  spscx,  and  pscx  with  estimator  YWR 
are  good  sampling  strategies  for  the  linear  popula- 
tions with  heterogeneous  error. 

4.  spps  with  any  of  the  regression  estimators  and  to  a 
somewhat  lesser  extent  RSRS  are  quite  efficient  for 
the  linear  populations  with  homogeneous  error 
structure. 

5.  RSRS  with  YLR  or  YBLU  is  very  efficient  for  the 
populations  with  weaker  y  -  x  relationships. 

Variance  Estimation  Bias  (table  3) 

The  variance  estimation  biases  for  classical,  jackknife, 
and  bootstrap  variance  estimators  are  given  in  table  3 
for  the  eight  populations. 

Variance  estimation  bias  can  be  summarized  as 
follows. 


spps,  YHT 

spps,  YBLU 

spscx,  YPI 

(0.67) 

(0.67) 

(0.68) 

spscx,  YWR 

pps,  YBLU 

spps,  YHT 

(0.46) 

(0.53) 

(0.56) 

pscx,  YWR 

spps,  YBLU 

spps,  YPI 

(4.22) 

(4.37) 

(4.38) 

pscx,  YBLU 

spps,  YBLU 

pscx,  YLR 

(7.28) 

(7.31) 

(7.33) 

spps,  YWR 

spps,  YHT 

spps,  YPR 

(6.12) 

(6.13) 

(6.21) 

spps,  YBLU 

spps,  YWR 

spps,  YPI 

(0.59) 

(0.60) 

(0.60) 

spps,  YPI 

spps,  YWR 

spps,  YBLU 

(0.86) 

(0.90) 

(0.90) 

SRS,  YLR 

spps,  YLR 

spps,  YBLU 

YWR,  YPI 

(0.14) 

(0.14) 

(0.15) 

1.  None  of  the  three  variance  estimators  (classical,  jack- 
knife,  bootstrap)  dominates  the  others  in  terms  of 
variance  estimator  bias  for  all  eight  populations,  for 
all  six  sampling  designs,  or  for  all  seven  estimators 
considered. 

2.  In  general,  the  bootstrap  variance  estimator  tends  to 
be  the  least  biased  for  the  cases  studied.  But  the  boot- 
strap variance  estimator  can  be  very  seriously  biased 
for  YPI  with  pps  and  spscx  for  all  eight  populations. 

3 .  The  classical  variance  estimator  of  YWR  has  no  bias 
for  population  1  as  expected.  But  for  the  other  popu- 
lations the  bootstrap  variance  estimator  tends  to  be 
the  least  biased. 

4.  The  classical  and  the  jackknife  variance  estimators 
of  YHT  for  pps  have  small  biases  for  all  eight  popu- 
lations. 

5.  Sample  selection  methods  spps,  spscx,  and  pscx  with 
estimator  YWR  tend  to  have  the  lowest  mean  square 
error.  For  these  the  following  is  true  for  variance 
estimation: 

a.  spps  with  YWR  tends  to  have  reliable  bootstrap 
variance  estimates  (<8.58%  for  all  8  popu- 
lations). 

b.  spscx  with  YWR  tends  to  have  less  reliable  boot- 
strap variance  estimates  ( <  20.89%  for  all  8  popu- 
lations). 

c.  pscx  with  YWR  tends  to  have  even  less  reliable 
bootstrap  variance  estimates  (< 32.08%  for  all  8 
populations). 

6.  Variance  estimation  with  RSRS  tends  to  be  less  biased 
than  with  SRS  but  sometimes  is  drastically  worse. 

Coverage  Probability  of  95%  Confidence  Interval 
(table  4) 

As  indicated  earlier,  jackknife  variance  estimates  con- 
sistently overestimate  the  true  variance  of  the  estimat- 
ed total.  This  explains  why  using  the  jackknife  variance 
estimates,  43%  of  the  time  (113/264  cases)  the  coverage 
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Table  3. — Bias  of  classical,  jackknife,  and  bootstrap  variance  estimators  expressed  as  percent  of  the 
iterated  variance  for  the  seven  classical  estimators  with  the  six  sample  selection  methods  with  n  =  20 
and  10,000  iterations  from  each  population  (2,000  iterations  for  bootstrap  variance  estimates).^ 


Population 
Sampling 

design  YRM  YHT         YLR^  YBLU  YPR^        YWR  YPl" 


1.  Simulated  loblolly  pine,  volume  N  =  1795 


SRS  -6.87 

-30.19 

{-23.4}3 

(7.42)^ 

(27.17) 

[-12.88]  5 

[-9.73] 

RSRS  7.62 

-18.40 

{-19.2}^ 

(23.99) 

25.54 

[9.36] 

[-3.75] 

pps 

1 .56 

42.92 

(0.43) 

(19.16) 

[-42.35] 

[4.24] 

spps 

3.48 

98.41 

(62.35) 

(13.39) 

[5.79] 

[12.40] 

spscx 

49.73 

(15.89) 

[6.08] 

pscx 

48.45 

(17.63) 

[8.84] 

Dblolly  pine,  volume  N  = 

1801 

SRS  -20.06 

-22.20 

{-5.3} 

(5.23) 

(30.99) 

[19.36] 

[-25.11] 

RSRS  0.69 

-28.98 

{-28.5} 

(19.88) 

(25.34) 

[-2.81] 

[-14.75] 

pps 

4.19 

12.54 

(3.03) 

(25.24) 

[-9.68] 

[-1 1 .05] 

spps 

3.80 

269.44 

(26.11) 

(63.88) 

[1.04] 

[39.38] 

spscx 

68.43 

(87.78) 

[14.93] 

pscx 

1 12.93 

(64.39) 

[32.09] 

nus  radiata,  volume  N  = 

2761 

SRS  -5.07 

-23.32 

{-10.0} 

(4.36) 

(23.10) 

[-9.53] 

[-11.35] 

RSRS            1 1 .69 

-15.74 

{-14.9} 

(21.75) 

(22.64) 

[5.74] 

[-2.65] 

pps 

-0.74 

50.66 

(-1.46) 

(14.34) 

[-29.38] 

[0.50] 

spps 

-0.21 

95.42 

(40.69) 

(15.96) 

[-2.55] 

[9.71] 

spscx 

77.79 

(22.45) 

[9.94] 

pscx 

116.94 

(51.04) 
[36.38] 


-6.93 

-6.93 

-1.16 

-24.96 

(12.79) 

(13.61) 

(9.31) 

(27.17) 

-7 

[_0  SRI 

[-3.  1 OJ 

-7.89 

-7.89 

-1.90 

-18.15 

(12.09) 

(12.94) 

(9.21) 

(25.54) 

[-8.84] 

[-8.85] 

[-2.00] 

[-3.75] 

-0.91 

-1 1 .48 

0.56 

55.14 

(20.90) 

(36.84) 

(24.47) 

(20.11) 

[12.56] 

[-0.17] 

[14.28] 

[492.43] 

0.72 

-12.09 

1 .41 

52.86 

(7.65) 

(15.19) 

(7.74) 

(12.78) 

[4.67] 

[-8.47] 

[5.80] 

[90.52] 

-3.29 

-16.48 

0.01 

56.55 

(13.55) 

(33.60) 

(18.85) 

(16.45) 

[7.71] 

[-5.49] 

[11.10] 

[562.88] 

-1 1 .49 

-1 1 .49 

0.56 

(32.44) 

(35.28) 

(16.79) 

[-1.06] 

[-1.10] 

[4.33] 

-48.66 

-48.66 

-24.30 

-16.80 

(6.86) 

(9.10) 

(10.32) 

(30.99) 

-43.49] 

[-43.39] 

[-19.69] 

[-PS  1 11 

-35.17 

-35.17 

-19.93 

-28.77 

(7.09) 

(8.52) 

(7.33) 

(25.34) 

-26.34] 

[-26.27] 

[-9.46] 

[-14.75] 

34.43 

-42.1 1 

22.81 

1 30.87 

(26.45) 

(25.55) 

(27.94) 

(36.98) 

[29.00] 

[-41 .35] 

[19.07] 

[1022.77] 

-18.51 

-66.1 6 

-24.30 

90.24 

(9.97) 

(11.31) 

(9.58) 

(32.84) 

[-2.41] 

[-58.27] 

[-8.58] 

[93.48] 

35.46 

-41 .59 

26.41 

194.51 

(27.33) 

(68.37) 

(36.83) 

(53.93) 

[27.31] 

[-45.24] 

[17.75] 

[1373.74] 

-25.98 

-25.98 

9.60 

(44.35) 

(49.19) 

(23.15) 

-24.19] 

[-24.02] 

[11.80] 

-18.56 

-18.56 

-10.07 

-18.36 

(11.08) 

(11.83) 

(10.29) 

(23.10) 

-15.60] 

[-15.58] 

[-7.28] 

[-1.34] 

-17.35 

-17.35 

-10.88 

-5.32 

(10.55) 

(11.23) 

(8.97) 

(22.64) 

[-9.41] 

[-9.40] 

[-2.56] 

1-2.65] 

7.08 

-5.93 

6.41 

51.38 

(11.28) 

(17.63) 

(12.69) 

(12.85) 

[3.30] 

[-9.06] 

[2.55] 

[234.05] 

-6.64 

-20.02 

-7.09 

38.77 

(6.76) 

(12.13) 

(6.96) 

(12.81) 

[-5.56] 

[-19.17] 

[-6.23] 

[35.61] 

6.21 

-7.35 

6.77 

55.05 

(9.88) 

(22.12) 

(12.72) 

(13.62) 

[2.38] 

[-11.38] 

[2.72] 

[264.77] 

0.15 

0.15 

15.96 

(39.37) 

(42.02) 

(21 .42) 

[-11.42] 

[-11.37] 

[5.05] 
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Table  3. — Continued 


Population 
Sampling 

design  YRM  YHT         YLR^  YBLU  YPR^        YWR  YPl" 


4.  Southern,  remeasured  N  =  275 


oHb                   1 .61 

-8.20 

60.18 

60.18 

57.95 

-2.76 

(3.41) 

{5.1} 
(10.79) 

(13.08) 

(13.50) 

(1 1 .65) 

(10.78) 

[4  961 

-3  621 

f1  191 

[  1 . 1 

[1  061 

[1  271 

-3  621 

RSRS  -0.15 

-7.49 

53.51 

53.51 

54.62 

-6.51 

{-13.7} 

(2.35) 

(6.57) 

(6.36) 

(6.80) 

(6.37) 

(6.57) 

[8.78] 

[2.43] 

[9.46] 

[9.33] 

[9.57] 

[2.43] 

pps 

5.77 

18.22 

-14.92 

-25.06 

-17.43 

-0.78 

(-1.93) 

(19.10) 

(15.27) 

(19.48) 

(16.32) 

(13.77) 

[-39.79] 

[28.10] 

[154.66] 

[104.67] 

[10.89] 

[85.29] 

spps 

7.80 

29.33 

3.80 

-6.06 

4.23 

7.56 

(84.01) 

(10.76) 

(12.64) 

(18.72) 

(13.49) 

(10.99) 

[3.55] 

[8.56] 

[38.09] 

[22.31] 

[3.41] 

[23.00] 

spscx 

31.79 

-23.54 

-36.24 

-25.85 

-9.99 

(16.18) 

(19.84) 

(28.29) 

(22.79) 

(17.92) 

[14.94] 

[67.44] 

[34.93] 

[-2.68] 

[67.93] 

pscx 

51.63 

-4.64 

-4.64 

-2.15 

(15.36) 

(50.96) 

(52.05) 

(38.25) 

[30.39] 

[33.73] 

[33.83] 

[32.08] 

ine  Creek,  diameter  N 

=  2748 

SRS  2.66 

-6.02 

27.20 

27.20 

27.66 

1 .20 

{35.5} 

(4.11) 

(16.53) 

(15.93) 

(16.03) 

(14.44) 

(16.53) 

l-fi  7fil 
[-D./DJ 

I'  -^^J 

[10.02] 

[10.02] 

[10.33] 

\7  941 
I'  •='^1 

RSRS  6.22 

-3.38 

26.15 

26.15 

25.23 

-1.58 

{-0.24} 

(6.92) 

(10.28) 

(11.30) 

(11.35) 

(9.87) 

(10.28) 

[8.80] 

[12.62] 

[16.02] 

[16.02] 

[15.73] 

[12.62] 

pps 

2.04 

-9.26 

-7.22 

-9.68 

-7.54 

-7.61 

(1.30) 

(6.31) 

(7.51) 

(8.96) 

(7.78) 

(7.18) 

[-19.31] 

[-0.13] 

[12.37] 

[8.56] 

[-2.87] 

[13.01] 

spps 

0.82 

3.07 

10.75 

8.94 

10.38 

1.84 

(24.97) 

(8.26) 

(9.16) 

(11.48) 

(9.59) 

(8.45) 

[-0.671 

[-0.15 

[5.69] 

[4.65] 

[0.62] 

[3.61] 

spscx 

-3.30 

-5.88 

-6.81 

-5.69 

-5.99 

(5.88) 

(8.01) 

(1 1 .63) 

(8.67) 

(7.86) 

[3.04 

[21.64] 

[20.71] 

[3.88] 

[20.25] 

pscx 

-5.10 

-7.92 

-7.92 

-7.04 

(5.68) 

(10.55) 

(10.75) 

(8.39) 

[1.53] 

[1.50] 

[1.50] 

[0.92] 

labama,  remeasured  N 

=  1905 

SRS  2.09 

-7.24 

168.89 

168.89 

116.20 

-1 .26 

{3.7} 

(6.49) 

(12.68) 

(25.24) 

(25.43) 

(18.41) 

(12.68) 

[2.66] 

[-9.42] 

[9.54] 

[9.46] 

[-0.56] 

[-9.42] 

RSRS  57.99 

-3.87 

187.35 

187.35 

136.01 

-3.21 

{-3.7} 

(63.59) 

(9.33) 

(21.08) 

(21.41) 

(18.04) 

(9.33) 

[1.58] 

[-3.54] 

[15.47] 

[15.40] 

[3.93] 

[-3.54] 

pps 

0.13 

-10.49 

-34.35 

-46.41 

-36.78 

-27.85 

(-0.93) 

(8.42) 

(15.83) 

(20.72) 

(16.93) 

(12.56) 

[-68.32] 

[2.68] 

[328.59] 

[246.95] 

[-13.41] 

[54.23] 

spps 

2.86 

12.08 

19.13 

-2.13 

18.79 

-4.60 

(346.01) 

(8.14) 

(17.41) 

(25.73) 

(17.35) 

(11.39) 

[-25.27] 

[-5.33] 

[77.27] 

[46.60] 

[-5.79] 

[13.45] 

spscx 

-1.84 

-39.76 

-57.21 

-43.86 

-33.87 

(9.99) 

(26.07) 

(36.92) 

(29.81) 

(20.40) 

[1.85] 

[55.44] 

[11.50] 

[-15.57] 

[52.48] 

pscx 

-7.91 

-60.75 

-60.75 

-49.07 

(7.27) 

(27.55) 

(29.53) 

(24.89) 

[-0.47] 

[-36.86] 

[-36.81] 

[-21.75] 
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Table  3.— Continued 


Population 


Sampling 
design 

YRM 

YHT 

YLR2 

YBLU 

YPR3 

YWR 

Yp|4 

7.  Pinus  radiata,  bark  N  =  5142 

SRS 

6.12 

^  Q  QQ 

—  1  o.yo 

3246 

98 

OO/IC  QQ 

0<i4D.yo 

662 

93 

—  /.Of 

2089.07 

{-/ .b} 

(19.12) 

(14.74) 

^87 

69) 

/OO  ~7~7\ 

(82.77) 

f41 

56) 

(1 4.  /4) 

[34.16] 

[b.bTJ 

[344 

86] 

1-21 

721 

Ib.blJ 

nono 

oco  on 

—UMO 
f  n  A  \ 

11556 

6 

1  1  DOD.D 

882 

49 

— U.  1  D 

(381.38) 

{-0.4} 

/■i  Q  QO\ 

[\  O.yo) 

(245 

81) 

/o/\  1  '3^^ 

.OU) 

(54 

53) 

(1  0.\50) 

f534  321 

[16.21] 

[1979 

85] 

[-10.98] 

116 

211 

nn<^ 

0.05 

-56.11 

-64 

23 

-7T.67 

-65 

59 

-56.72 

(-0^34) 

(10.35) 

(45 

50) 

(54.13) 

(48 

16) 

(36.84) 

[-97.88] 

[-<£.Jbj 

[82366 

5] 

[DO^U4.Dj 

[-7 

31] 

[0  /  .DDJ 

spps 

0.71 

7.24 

42 

87 

-18.52 

45 

42 

-10.18 

(2089.07) 

(27.64) 

(36 

76) 

(51.27) 

(36 

73) 

(20.90) 

[-72  22 

[12.98] 

[80236 

5] 

[45251.2] 

[2 

79] 

[93.11] 

spscx 

OO.O  I 

-73 

83 

_QO  70 
0^.  /  0 

-74 

84 

-RK  1  T 

-DO.  1  0 

(43.33) 

(47 

96) 

(58.50) 

(53 

34) 

(53.70) 

[35.77] 

[107 

22] 

[39.75] 

[-4 

52] 

[196.86] 

pscx 

-34.55 

-85 

70 

-85.70 

-78 

70 

(45.65) 

(79 

40) 

(83.06) 

(78 

54) 

[38.62] 

[-33 

19] 

[-33.16] 

[-11 

66] 

8.  New  York, 

remeasured  N 

=  622 

SRS 

-1.95 

-6.10 
111  55 

236 

20 

236.20 

171 

01 

-0.27 

(2.83) 

(13.51) 

(27 

91) 

(28.13) 

(27 

16) 

(13.51) 

[-1.82] 

-1  561 

[6 

10] 

15  921 

[1 

85] 

[-1  561 

RSRS 

49.95 

-4.97 

271 

07 

271 .07 

186 

33 

-4.20 

(53.64) 

\  <J  i  I 

(O.J/) 

(23 

26) 

^^l0.oy) 

(20 

59) 

[58.05] 

[-4.71] 

[16 

30] 

[16.01] 

[7 

69] 

[-4.71] 

pps 

0.99 

-22.17 

-37 

38 

-51 .61 

-41 

48 

-32.68 

(-2.26) 

(15.25) 

(22 

37) 

(24.08) 

(22 

55) 

(18.62) 

[-72.16] 

[8.57] 

[530 

91] 

[361.09] 

[-6 

87] 

[58.90] 

spps 

2.22 

12.63 

11 

04 

-15.25 

10 

86 

-10.87 

(407.40) 

(12.38) 

(20 

92) 

(29.24) 

(21 

96) 

(13.98) 

[-18.64] 

[3.12] 

[91 

86] 

[45.22] 

[-2 

59] 

[15.19] 

spscx 

-6.29 

-46 

51 

-62.86 

-51 

45 

-38.82 

(25.58) 

(32 

35) 

(43.65) 

(35 

50) 

(32.49) 

[11.07] 

[42 

34] 

[-3.38] 

[-20 

09] 

[55.10] 

pscx 

-8.35 

-64 

97 

-64.97 

-55 

42 

(17.84) 

(40 

46) 

(42.42) 

(38 

58) 

[9.17] 

[-32 

85] 

[-32.69] 

[-18 

47] 

'^Variance  bias  ratings:  <5%,  no  bias;  5-70%,  small  bias;  10-20%,  moderate  bias;  20-50%,  seri- 
ous bias;  >  50%,  very  serious  bias. 
^Not  appropriate  for  pscx. 

(YfJ  is  only  shown  for  SRS  and  RSRS  in  braces. 
^The  values  in  parentheses  are  the  biases  of  jackknife  variance  estimates. 


^The  values  in  brackets  are  the  biases  of  bootstrap  variance  estimates. 


probabilities  fall  in  the  interval  (0.94,  0.96)  as  opposed 
to  only  26%  for  the  classical  (67/264)  and  33%  for  the 
bootstrap  (86/264).  These  percentages  increase  to  63%, 
46%,  and  49%,  respectively  if  we  consider  the  interval 
>:0.94  to  be  acceptable. 

Coverage  probabilities  can  be  summarized  as  follows. 

With  this  sample  size  (n  =  20)  one  can  expect  unac- 
ceptable coverage  probabilities  for  at  least  some  popu- 
lations with  any  sampling  strategy  because  the  sample 
size  is  so  small  and  several  populations  are  highly 


skewed;  this  is  what  basically  happened.  The  only  ex- 
ceptions are  pps  and  spps  sampling  with  YPI  using  the 
bootstrap  variance  estimator  and  spps  with  YHT  using 
the  jackknife  variance  estimator.  Our  principal  interest 
with  coverage  probabilities  is  to  see  how  sampling  strate- 
gies performed  that  had  either  low  estimation  bias  and/or 
low  mean  square  error  and/or  low  variance  estimation 
bias. 

Both  pps  and  spps  with  YHT  provide  unbiased  esti- 
mates of  the  population  total  and  essentially  unbiased 
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Table  4  — Percentage  of  coverage  probability  of  95%  confidence  interval  that  includes  true  popula- 
tion total  for  the  seven  classical,  jackknife,  and  bootstrap  estimators  with  the  six  sample  selection 
methods  with  n  =  20  and  10,000  iterations  from  each  population  (2,000  iterations  for  bootstrapping). 


Population 
Sampling 

design  YRM  YHT         YLR^         YBLU         YPR^          YWR  YPP 


1.  Simulated  loblolly  pine,  volume  N  =  1795 


SRS  92.1 

84.6 

94.1 

94.1 

94.8 

86.3 

(93.0)^ 

(92.3) 

(93.2) 

(93.2) 

(93.9) 

(92.3) 

[Qfl  41 3 

t90  41 

r92  11 

[92  11 

92  11 

[90  41 

RSRS  93.9 

89.5 

94.0 

94.0 

94.8 

89.5 

(94.5) 

(92.8) 

(93.2) 

(93.2) 

(94.1) 

(92.8) 

[93.9] 

[91.5] 

[91.5] 

[91.5] 

[92.7] 

[91.5] 

pps 

95.0 

97.9 

94.5 

93.4 

94.8 

97.8 

(94.9) 

(97.6) 

(95.0) 

(94.4) 

(94.9) 

(95.3) 

[86.0] 

[96.0] 

[94.2] 

[92.9] 

[93.9] 

[99.8] 

spps 

93.9 

98.9 

95.0 

93.6 

95.0 

97.5 

(98.2) 

(96.5) 

(95.3) 

(94.6) 

(95.2) 

(95.6) 

[93.2] 

[96.1] 

[94.3] 

[92.8] 

[94.3] 

[98.5] 

spscx 

98.4 

94.8 

93.1 

95.2 

98.3 

(97.8) 

(95,8) 

(95.1) 

(95.9) 

(96.1) 

[96.2] 

[94.7] 

[93.3] 

[94.9] 

[100.0] 

pscx 

98.4 

93.5 

93.5 

94.9 

(97.8) 

(94.7) 

(94.8) 

(95.3) 

[97.2] 

[93.0] 

[93.1] 

[95.3] 

Dblolly  pine,  volume  N  = 

1801 

SRS  89.7 

82.4 

86.6 

86.6 

91.3 

84.1 

(90.8) 

(92.1) 

(92.9) 

(93.0) 

(93.5) 

(92.1) 

[oS.yj 

rQQ  A] 
[00.4J 

tQ-7  Ql 

[Of  .a\ 

[yi.uj 

[oo.h\ 

RSRS  91.6 

86.5 

88.8 

88.8 

91.1 

86.5 

(92.4) 

(91.5) 

(92.3) 

(92.3) 

(92.9) 

(91.5) 

[92.2] 

[90.7] 

[89.9] 

[89.9] 

[91.7] 

[90.7] 

pps 

95.3 

97.3 

96.9 

86.8 

96.4 

98.7 

(95.1) 

(98.2) 

(95.7) 

(95.7) 

(95.8) 

(96,5) 

[91.9] 

[96.9] 

[94.8] 

[86.7] 

[94.8] 

[99.9] 

spps 

92.2 

99.3 

92.6 

79.4 

82.1 

97.7 

(97.5) 

(96.5) 

(95.2) 

(94.8) 

(95.2) 

(95.9) 

[89.8] 

[96.2] 

[93.7] 

[83.1] 

[93.5] 

[97.7] 

spscx 

99.5 

96.9 

88.0 

96.6 

99.4 

(99.1) 

(96.5) 

(97.7) 

(96.9) 

(97,1) 

[97.6] 

[95.6] 

[86.0] 

[95.4] 

[100.0] 

pscx 

99.0 

92.7 

92.7 

95.8 

(97.6) 

(97.0) 

(97.2) 

(95.8) 

[96.3] 

[92.0] 

[92.2] 

[94.6] 

3.  Pinus  radiata,  volume  N  =  2761 


SRS  93.6 

91.4 

96.2 

96.2 

96,3 

92.2 

(93.7) 

(93.2) 

(94,1) 

(94.1) 

(94,4) 

(93.2) 

[92.9] 

[92.8] 

[93.4] 

[93.3] 

[93,8] 

[92.8] 

RSRS  93.3 

91.9 

95.9 

95.9 

96,1 

92.0 

(93.3) 

(92.9) 

(93.3) 

(93.3) 

(93,9) 

(92.9) 

[94.7] 

[92.5] 

[93.5] 

[93.5] 

[93,5] 

[92.5] 

pps 

92.9 

93.5 

91.1 

89.8 

90.7 

92.8 

(92.1) 

(94.2) 

(92.4) 

(92.1) 

(92.4) 

(92.8) 

[90.2] 

[94.4] 

[96.3] 

[96.0] 

[93.2] 

[97.5] 

spps 

92.7 

94.9 

93.5 

93.0 

93.5 

94.0 

(95.3) 

(94.3) 

(94.0) 

(94.1) 

(94.1) 

(94.1) 

[92.4] 

[93.4] 

[94.9] 

[94,2] 

[92.8] 

[94.9] 

spscx 

94.6 

90.1 

87,3 

89.8 

92.4 

(94.8) 

(92.9) 

(92.9) 

(92.9) 

(93.0) 

[93.5] 

[96.8] 

[95.9] 

[92.4] 

[97.7] 

pscx 

95.1 

88,9 

88.9 

89.7 

(94.1) 

(92.0) 

(92.0) 

(92.2) 

[94.8] 

[92.1] 

[92.1] 

[92.6] 

outhern,  remeasured  N 

=  275 

SRS  93.1 

88.2 

92.4 

92.4 

93.6 

89.4 

(93.8) 

(93.9) 

(94.5) 

(94.5) 

(94.7) 

(93.9) 

[94.2] 

[92.6] 

[92.3] 

[92.3] 

[93.8] 

[92.6] 

RSRS  95.6 

91.1 

93.1 

93.1 

93.8 

91.2 

(96.1) 

(94.2) 

(94.6) 

(94.6) 

(94.9) 

(94,2) 

[93.4] 

[93.5] 

[93.4] 

[93.4] 

[94.2] 

[93.5] 
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Table  4.— Continued 


Population 


design  YRM 

YHT 

YLH 

YdLU 

w  n  n  3 
Yrn 

VU/D 

YWH 

YPI 

pps 

95.0 

98.1 

95.6 

93.9 

95.5 

97.6 

(95.0) 

(97.1) 

(95.4) 

(95.1) 

(95.3) 

(95.7) 

[85.8] 

[96.4] 

[94.6] 

[92.6] 

[94.2] 

[99.7] 

spps 

no  o 

fiO  "7 

QO  O 

Q>1  O 

a  /  .o 

(aD.'+j 

/QC  C\ 

/QC  i;\ 

(yo.oj 

/QC  C\ 

(yo.b) 

/QC  Q\ 

(ao.y^ 

[92.0] 

[95.3] 

[93.6] 

[91 .7] 

[93.5] 

[96.9] 

spscx 

yo.y 

yo.y 

yo.y 

QC  7 

yo.  / 

^y  /  .yj 

(yo.y) 

/Qc;  Q\ 
(ao.aj 

(yo.uj 

(ab.^) 

[9b. a] 

[94.4] 

[92.7] 

[94.5] 

[99.8] 

pscx 

yy.u 

yo.  / 

QO  7 

ao.  / 

QC  n 
yb.u 

^Qfl  n\ 

[ziO.U) 

^ycu; 

(ao.! ) 

/QC   1  \ 

(ao.  1 ) 

f flC  "71 

[yb.  /] 

[9^.4] 

[92.4] 

[94.6] 

ine  Creek,  diameter  N 

=  2748 

SRS  94.6 

93.8 

96.0 

96.0 

96.0 

94.8 

(94.7) 

(95.2) 

(95.4) 

(95.3) 

(95.3) 

(95.2) 

[94.0] 

[93.9] 

[94.2] 

[94.2] 

[94.2] 

[93.9] 

RSRS  95.0 

94.1 

96.3 

96.3 

96.3 

94.2 

(95.0) 

(95.0) 

(95.2) 

(95.2) 

(95.3) 

(95.0) 

[94.7] 

[95.1] 

[95.2] 

[95.2] 

[95.4] 

[95.1] 

pps 

93.8 

94.0 

93.1 

92.4 

93.0 

93.4 

/OR  0\ 

^y4.uj 

/QO  R\ 

(y4.uj 

fQA  0\ 

(94.ii) 

rod  "^i 

[a*+.oj 

[QO  Q1 

[ao.aj 

fQO  ftl 

[Q(^  Al 

spps 

93.0 

95.1 

95  2 

(95.9) 

(95.4) 

(95.2) 

(95.1) 

(95.2) 

^s70.0^ 

IQO  HI 

fQ/i  r>\ 

[Q/l  Cl 

[a^.Dj 

ro^  oi 
[yH.oj 

[y4.uj 

[Q/1  Ol 

[y4.v3j 

spscx 

94.4 

93.4 

92.9 

93.5 

93.7 

(95.2) 

(94.8) 

(94.3) 

(94.6) 

^Q4  fll 

IQ4  71 

fQO  Ol 

[ao.oj 

[QC  41 

[at).4j 

pscx 

94.1 

92.7 

92.7 

93  1 

(95.0) 

(94.0) 

(94.0) 

(94.4) 

rod  Ri 

fQO  a\ 

IQT  41 

[QO  71 

[ao.  /  J 

labanna,  remeasured  N 

=  1905 

SRS  95.2 

94.2 

98.2 

98.2 

98.4 

95.0 

(95.4) 

(95.8) 

(96.5) 

(96.5) 

(96.9) 

(95.8) 

[95.7] 

[93.1] 

[94.0] 

[94.0] 

[95.1] 

[93.1] 

RSRS  98.2 

94.6 

98.8 

98.8 

98.6 

84.7 

(98.3) 

(95.4) 

(96.2) 

(96.1) 

(96.4) 

(95.4) 

[97.8] 

[94.1] 

[95.8] 

[95.8] 

[94.9] 

[94,1] 

pps 

88.9 

93.5 

88.9 

85.4 

88.1 

91.1 

(OO./) 

/QO  1  \ 

/QO  0\ 

/QO  Q\ 

/QO  0\ 

(yo,o; 

[/O.UJ 

[Qc  ni 

[yo.'f  J 

[y  1  .oj 

[Q7  Ol 

Spps 

92.8 

96.0 

95.5 

QA.  R 
57H.O 

(98.5) 

(95.8) 

(95.5) 

(95  0) 

[a*+.uj 

[yo.  /  J 

[QQ  Cl 

[yo.Dj 

[QC  /II 

94.2 

87.5 

O  1  ,\J 

OO.O 

ftq  R 
oy  .o 

(95.6) 

(Q9  ftl 

\p  \  .O) 

/QT  ni 

[a^.oj 

rQ7  f^i 
[y  /  .Dj 

[yo.oj 

ron  Ql 

[yu.yj 

[Q7  >1 1 

[9/ .4] 

pscx 

ft 

O  1  .o 

o  1  .O 

(95.5) 

^^7^.  1  ) 

/Q'3  n\ 

ia*+.oj 

FflP  Ol 

ran  t^i 

inus  radiata,  barl<  N  = 

5142 

SRS  94.7 

92.2 

99.8 

99.8 

99.7 

93.5 

(94.7) 

(94.8) 

(97.8) 

(97.9) 

(98.9) 

(94.8) 

[95.2] 

[93.9] 

[96.6] 

[96.6] 

[97.2] 

[93.9] 

RSRS  99.9 

94.6 

100.0 

100,0 

99.9 

94.7 

(99.9) 

(95.8) 

(98.3) 

(98.3) 

(99.3) 

(95.8) 

[100.0] 

[95.5] 

[98.7] 

[98.7] 

[98.5] 

pps 

68.1 

67.3 

79.1 

73.0 

77.9 

83.0 

(68.0) 

(88.0) 

(91.2) 

(89.5) 

(90.8) 

(91.4) 

[37.6] 

[84.7] 

[93.1] 

[92.5] 

[90.2] 

[96.3] 

spps 

79.5 

95.1 

95.7 

90.1 

95.7 

94.2 

(100.0) 

(96.2) 

(95.3) 

(92.7) 

(95.4) 

(95.7) 

[77.7] 

[94.9] 

[97.1] 

[93.0] 

[93.6] 

[98,8] 
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Table  4.— Continued 


Population 
Sampling 

design  YRM  YHT         YLR^         YBLU         YPR^          YWR  YPP 


spscx 

69.9 

71.2 

62.2 

70.1 

78.8 

(88.8) 

(90.1) 

(89.8) 

(90.0) 

(91  0) 

[86.7] 

[99.5] 

[98.7] 

[92.7] 

[99.7] 

pscx 

73.2 

57.1 

57.1 

68.1 

(91.5) 

(89.7) 

(89.8) 

(91.7) 

[89.7] 

[90.4] 

[90.5] 

[93.4] 

ew  York,  remeasured  N  =  622 

SRS 

95.3 

QR  n 

QR  n 

y^.D 

(95.6) 

(ytD.o) 

/QCZ  Q\ 

(yo.o) 

(yb.H) 

(yo.o) 

[94.8] 

iy4.t)j 

[yo.oj 

[yo.oj 

RSRS 

98.0 

94.6 

99.1 

99.1 

99.1 

94.6 

(98.1) 

(95.6) 

(96.6) 

(96.6) 

(97.0) 

(95.6) 

[98.2] 

[94.4] 

[95.7] 

[95.7] 

[96.0] 

[94.4] 

pps 

88.8 

89.1 

88.5 

84.6 

87.8 

90.3 

(88.5) 

(94.6) 

(93.2) 

(92.7) 

(93.1) 

(93.6) 

[79.4] 

[94.0] 

[95.9] 

[95.0] 

[93.2] 

[98.4] 

spps 

92.1 

95.8 

94.7 

91.4 

94.6 

94.0 

(98.3) 

(96.0) 

(95.4) 

(95.0) 

(95.6) 

(95.7) 

[91.7] 

[94.4] 

[95.1] 

[93.5] 

[93.8] 

[95.7] 

spscx 

92.0 

85.5 

76.4 

83.4 

88.6 

(95.4) 

(93.0) 

(93.0) 

(92.9) 

(93.5) 

[93.9] 

[95.3] 

[93.1] 

[90.4] 

[97.8] 

pscx 

89.8 

75.5 

75.5 

80.9 

(93.4) 

(93.6) 

(93.7) 

(93.3) 

[92.3] 

[88.4] 

[88.4] 

[91.1] 

^  YLR  =  YPI  for  SRS  and  RSRS. 

^The  values  in  parentheses  are  the  percentages  of  coverage  probabilities  for  the  jackknife  estimators. 
^The  values  in  brackets  are  the  percentages  of  coverages  for  the  bootstrap  estimators. 


variance  estimates  for  all  populations,  spps  with  YHT 
using  the  jackknife  variance  estimates  provides  cover- 
age probabilities  larger  than  0.95  in  all  populations. 

YLR  is  essentially  unbiased  for  SRS  and  RSRS  in  all 
eight  populations.  Satisfactory  coverage  probabilities  are 
only  found  in  populations  3  and  5-8,  particularly  with 
RSRS,  the  use  of  the  jackknife  variance  estimator  giv- 
ing consistently  the  highest  coverage  probabilities. 

YRM  is  essentially  unbiased  for  SRS  and  RSRS  in  the 
first  six  populations  where  a  linear  relationship  exists 
between  the  variables.  But  satisfactory  coverage  prob- 
abilities are  generally  only  found  in  populations  3  and 
5-8. 

spps  sampling  with  estimator  YWR  tends  to  be  quite 
efficient  and  tends  to  have  reliable  bootstrap  variance 
estimates.  But  with  this  sampling  strategy  the  classical 
and  jackknife  variance  estimates  yield  satisfactory  cover- 
age probabilities  in  basically  all  populations  especially 
the  latter,  spscx  sampling  with  estimator  YWR  is  quite 
efficient  too.  But  coverage  probabilities  tend  to  be  ac- 
ceptable only  in  populations  1,  2,  3,  and  5. 


Summary 

With  the  large  variety  in  populations,  estimators,  sam- 
ple designs,  and  criteria  it  is  difficult  to  identify  sam- 
pling strategies  that  are  clearly  best.  Traditionally  the 
main  criteria  are  negligible  estimation  bias  combined 
with  small  mean  square  error.  In  that  regard  spps  and 
spscx  with  estimator  YWR  are  good  sampling  strategies, 
particularly  for  the  six  populations  with  linear  relation- 
ships, pscx  with  YWR  is  quite  efficient  too  across  popu- 
lations but  suffers  from  estimation  bias  problems  in 
several  cases,  pps  and  spps  with  YHT  have  the  advan- 
tage of  being  unbiased  for  all  populations  as  well  as  hav- 
ing essentially  unbiased  classical  and  jackknife  (for  pps 
only)  variance  estimators  for  all  populations.  But  YHT 
is  quite  inefficient  in  several  populations. 

RSRS  very  consistently  has  lower  mean  square  error 
than  SRS  with  all  estimators.  There  was  no  obvious  pat- 
tern on  whether  SRS  or  RSRS  was  better  in  terms  of  es- 
timation bias. 
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In  terms  of  these  most  efficient  estimators  discussed 
above,  spps  with  YWR  tends  to  have  reliable  bootstrap 
variance  estimates.  Those  for  spscx  w^ith  YWR  tend  to 
be  less  reliable.  Interestingly,  use  of  the  classical  and 
jackknife  variance  estimates  but  not  the  bootstrap  give 
satisfactory  coverage  probabilities  in  most  cases  for  spps 
with  YWR. 
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U.S.  Department  of  Agriculture 
Forest  Service 

Rocky  Mountain  Forest  and 
Range  Experiment  Station 

The  Rocky  Mountain  Station  is  one  of  eight 
regional  experiment  stations,  plus  the  Forest 
Products  Laboratory  and  the  Washington  Office 
Staff,  that  make  up  the  Forest  Service  research 
organization. 

RESEARCH  FOCUS 

Research  programs  at  the  Rocky  Mountain 
Station  are  coordinated  with  area  universities  and 
with  other  institutions.  Many  studies  are 
conducted  on  a  cooperative  basis  to  accelerate 
solutions  to  problems  involving  range,  water, 
wildlife  and  fish  habitat,  human  and  community 
development,  timber,  recreation,  protection,  and 
multiresource  evaluation. 


RESEARCH  LOCATIONS 

Research  Work  Units  of  the  Rocky  Mountain 
Station  are  operated  in  cooperation  with 
universities  in  the  following  cities: 


Albuquerque,  New  Mexico 

Flagstaff,  Arizona 

Fort  Collins,  Colorado* 

Laramie,  Wyoming 

Lincoln,  Nebraska 

Rapid  City,  South  Dakota 

Tempe,  Arizona 


•Station  Headquarters:  240  W.  Prospect  Rd.,  Fort  Collins,  CO  80526 


